Data Exploration

data <- read_csv("prosperLoanData.csv") %>% mutate_if(is.character, as.factor)

Data Cleaning

Above, I imported the character columns as factors, as having taken a closer look at the data, they are labels for categories, rather than strings (in the following analysis, I don’t find any disconfirmation of this). The first thing I will do now is take a closer look at the data, and see if other columns are formatted appropriately:

data[,1:7]
## # A tibble: 113,937 x 7
##    ListingKey          ListingNumber ListingCreationDate CreditGrade  Term
##    <fct>                       <int> <dttm>              <fct>       <int>
##  1 102133976686814541…        193129 2007-08-26 19:09:29 C              36
##  2 10273602499503308B…       1209647 2014-02-27 08:28:07 <NA>           36
##  3 0EE933782585103286…         81716 2007-01-05 15:00:47 HR             36
##  4 0EF535600248271529…        658116 2012-10-22 11:02:35 <NA>           36
##  5 0F023589499656230C…        909464 2013-09-14 18:38:39 <NA>           36
##  6 0F0535973482419938…       1074836 2013-12-14 08:26:37 <NA>           60
##  7 0F0A3576754255009D…        750899 2013-04-12 09:52:56 <NA>           36
##  8 0F1035772717087366…        768193 2013-05-05 06:49:27 <NA>           36
##  9 0F043596202561788E…       1023355 2013-12-02 10:43:39 <NA>           36
## 10 0F043596202561788E…       1023355 2013-12-02 10:43:39 <NA>           36
## # ... with 113,927 more rows, and 2 more variables: LoanStatus <fct>,
## #   ClosedDate <dttm>
str(data)
## Classes 'tbl_df', 'tbl' and 'data.frame':    113937 obs. of  81 variables:
##  $ ListingKey                         : Factor w/ 113066 levels "00003546482094282EF90E5",..: 7180 7193 6647 6669 6686 6689 6699 6706 6687 6687 ...
##  $ ListingNumber                      : int  193129 1209647 81716 658116 909464 1074836 750899 768193 1023355 1023355 ...
##  $ ListingCreationDate                : POSIXct, format: "2007-08-26 19:09:29" "2014-02-27 08:28:07" ...
##  $ CreditGrade                        : Factor w/ 8 levels "A","AA","B","C",..: 4 NA 7 NA NA NA NA NA NA NA ...
##  $ Term                               : int  36 36 36 36 36 60 36 36 36 36 ...
##  $ LoanStatus                         : Factor w/ 12 levels "Cancelled","Chargedoff",..: 3 4 3 4 4 4 4 4 4 4 ...
##  $ ClosedDate                         : POSIXct, format: "2009-08-14" NA ...
##  $ BorrowerAPR                        : num  0.165 0.12 0.283 0.125 0.246 ...
##  $ BorrowerRate                       : num  0.158 0.092 0.275 0.0974 0.2085 ...
##  $ LenderYield                        : num  0.138 0.082 0.24 0.0874 0.1985 ...
##  $ EstimatedEffectiveYield            : num  NA 0.0796 NA 0.0849 0.1832 ...
##  $ EstimatedLoss                      : num  NA 0.0249 NA 0.0249 0.0925 ...
##  $ EstimatedReturn                    : num  NA 0.0547 NA 0.06 0.0907 ...
##  $ ProsperRating (numeric)            : int  NA 6 NA 6 3 5 2 4 7 7 ...
##  $ ProsperRating (Alpha)              : Factor w/ 7 levels "A","AA","B","C",..: NA 1 NA 1 5 3 6 4 2 2 ...
##  $ ProsperScore                       : num  NA 7 NA 9 4 10 2 4 9 11 ...
##  $ ListingCategory (numeric)          : int  0 2 0 16 2 1 1 2 7 7 ...
##  $ BorrowerState                      : Factor w/ 51 levels "AK","AL","AR",..: 6 6 11 11 24 33 17 5 15 15 ...
##  $ Occupation                         : Factor w/ 67 levels "Accountant/CPA",..: 36 42 36 51 20 42 49 28 23 23 ...
##  $ EmploymentStatus                   : Factor w/ 8 levels "Employed","Full-time",..: 8 1 3 1 1 1 1 1 1 1 ...
##  $ EmploymentStatusDuration           : int  2 44 NA 113 44 82 172 103 269 269 ...
##  $ IsBorrowerHomeowner                : Factor w/ 2 levels "False","True": 2 1 1 2 2 2 1 1 2 2 ...
##  $ CurrentlyInGroup                   : Factor w/ 2 levels "False","True": 2 1 2 1 1 1 1 1 1 1 ...
##  $ GroupKey                           : Factor w/ 706 levels "00343376901312423168731",..: NA NA 334 NA NA NA NA NA NA NA ...
##  $ DateCreditPulled                   : POSIXct, format: "2007-08-26 18:41:46" "2014-02-27 08:28:14" ...
##  $ CreditScoreRangeLower              : int  640 680 480 800 680 740 680 700 820 820 ...
##  $ CreditScoreRangeUpper              : int  659 699 499 819 699 759 699 719 839 839 ...
##  $ FirstRecordedCreditLine            : POSIXct, format: "2001-10-11" "1996-03-18" ...
##  $ CurrentCreditLines                 : int  5 14 NA 5 19 21 10 6 17 17 ...
##  $ OpenCreditLines                    : int  4 14 NA 5 19 17 7 6 16 16 ...
##  $ TotalCreditLinespast7years         : int  12 29 3 29 49 49 20 10 32 32 ...
##  $ OpenRevolvingAccounts              : int  1 13 0 7 6 13 6 5 12 12 ...
##  $ OpenRevolvingMonthlyPayment        : num  24 389 0 115 220 1410 214 101 219 219 ...
##  $ InquiriesLast6Months               : int  3 3 0 0 1 0 0 3 1 1 ...
##  $ TotalInquiries                     : num  3 5 1 1 9 2 0 16 6 6 ...
##  $ CurrentDelinquencies               : int  2 0 1 4 0 0 0 0 0 0 ...
##  $ AmountDelinquent                   : num  472 0 NA 10056 0 ...
##  $ DelinquenciesLast7Years            : int  4 0 0 14 0 0 0 0 0 0 ...
##  $ PublicRecordsLast10Years           : int  0 1 0 0 0 0 0 1 0 0 ...
##  $ PublicRecordsLast12Months          : int  0 0 NA 0 0 0 0 0 0 0 ...
##  $ RevolvingCreditBalance             : num  0 3989 NA 1444 6193 ...
##  $ BankcardUtilization                : num  0 0.21 NA 0.04 0.81 0.39 0.72 0.13 0.11 0.11 ...
##  $ AvailableBankcardCredit            : num  1500 10266 NA 30754 695 ...
##  $ TotalTrades                        : num  11 29 NA 26 39 47 16 10 29 29 ...
##  $ TradesNeverDelinquent (percentage) : num  0.81 1 NA 0.76 0.95 1 0.68 0.8 1 1 ...
##  $ TradesOpenedLast6Months            : num  0 2 NA 0 2 0 0 0 1 1 ...
##  $ DebtToIncomeRatio                  : num  0.17 0.18 0.06 0.15 0.26 0.36 0.27 0.24 0.25 0.25 ...
##  $ IncomeRange                        : Factor w/ 8 levels "$0","$1-24,999",..: 4 5 7 4 3 3 4 4 4 4 ...
##  $ IncomeVerifiable                   : Factor w/ 2 levels "False","True": 2 2 2 2 2 2 2 2 2 2 ...
##  $ StatedMonthlyIncome                : num  3083 6125 2083 2875 9583 ...
##  $ LoanKey                            : Factor w/ 113066 levels "00003683605746079487FF7",..: 100337 69837 46303 70776 71387 86505 91250 5425 908 908 ...
##  $ TotalProsperLoans                  : int  NA NA NA NA 1 NA NA NA NA NA ...
##  $ TotalProsperPaymentsBilled         : int  NA NA NA NA 11 NA NA NA NA NA ...
##  $ OnTimeProsperPayments              : int  NA NA NA NA 11 NA NA NA NA NA ...
##  $ ProsperPaymentsLessThanOneMonthLate: int  NA NA NA NA 0 NA NA NA NA NA ...
##  $ ProsperPaymentsOneMonthPlusLate    : int  NA NA NA NA 0 NA NA NA NA NA ...
##  $ ProsperPrincipalBorrowed           : num  NA NA NA NA 11000 NA NA NA NA NA ...
##  $ ProsperPrincipalOutstanding        : num  NA NA NA NA 9948 ...
##  $ ScorexChangeAtTimeOfListing        : int  NA NA NA NA NA NA NA NA NA NA ...
##  $ LoanCurrentDaysDelinquent          : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ LoanFirstDefaultedCycleNumber      : int  NA NA NA NA NA NA NA NA NA NA ...
##  $ LoanMonthsSinceOrigination         : int  78 0 86 16 6 3 11 10 3 3 ...
##  $ LoanNumber                         : int  19141 134815 6466 77296 102670 123257 88353 90051 121268 121268 ...
##  $ LoanOriginalAmount                 : int  9425 10000 3001 10000 15000 15000 3000 10000 10000 10000 ...
##  $ LoanOriginationDate                : POSIXct, format: "2007-09-12" "2014-03-03" ...
##  $ LoanOriginationQuarter             : Factor w/ 33 levels "Q1 2006","Q1 2007",..: 18 8 2 32 24 33 16 16 33 33 ...
##  $ MemberKey                          : Factor w/ 90831 levels "00003397697413387CAF966",..: 11071 10302 33781 54939 19465 48037 60448 40951 26129 26129 ...
##  $ MonthlyLoanPayment                 : num  330 319 123 321 564 ...
##  $ LP_CustomerPayments                : num  11396 0 4187 5143 2820 ...
##  $ LP_CustomerPrincipalPayments       : num  9425 0 3001 4091 1563 ...
##  $ LP_InterestandFees                 : num  1971 0 1186 1052 1257 ...
##  $ LP_ServiceFees                     : num  -133.2 0 -24.2 -108 -60.3 ...
##  $ LP_CollectionFees                  : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ LP_GrossPrincipalLoss              : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ LP_NetPrincipalLoss                : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ LP_NonPrincipalRecoverypayments    : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ PercentFunded                      : num  1 1 1 1 1 1 1 1 1 1 ...
##  $ Recommendations                    : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ InvestmentFromFriendsCount         : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ InvestmentFromFriendsAmount        : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ Investors                          : int  258 1 41 158 20 1 1 1 1 1 ...

The first thing I notice is that there are several date columns which should be formatted as such, and several boolean (True/False) type columns. ListingCategory.num is actually a category label, not a numeric measure. I also want to order the levels in some of the factor columns, as they are inherently ordered (CreditGrade, ProsperRating.alpha, IncomeRange, LoanOriginationQuarter). Several of the columns have spaces or special characters in the column names, which makes it difficult to refer to these columns - I will rename these.

data %<>% 
  mutate_at(c("ListingCreationDate","ClosedDate","DateCreditPulled","FirstRecordedCreditLine","LoanOriginationDate"), as.Date) %>%
  mutate_at(c("IsBorrowerHomeowner","CurrentlyInGroup","IncomeVerifiable"), as.logical) %>%
  rename_all(~sub(" (numeric)", ".num", ., fixed=TRUE)) %>%
  rename_all(~sub(" (Alpha)", ".alpha", ., fixed=TRUE)) %>%
  rename_all(~sub(" (percentage)", ".per", ., fixed=TRUE)) %>%
  mutate_at("ListingCategory.num", as.factor)

data$CreditGrade <- ordered(data$CreditGrade, c("NC","HR","E","D","C","B","A","AA"))
data$ProsperRating.alpha <- ordered(data$ProsperRating.alpha, c("NC","HR","E","D","C","B","A","AA"))
data$IncomeRange <- ordered(data$IncomeRange, c("Not displayed","Not employed","$0","$1-24,999","$25,000-49,999","$50,000-74,999","$75,000-99,999","$100,000+"))
data$LoanOriginationQuarter <- ordered(data$LoanOriginationQuarter, c("Q1 2006", "Q2 2006", "Q3 2006", "Q4 2006", "Q1 2007", "Q2 2007", "Q3 2007", "Q4 2007", "Q1 2008", "Q2 2008", "Q3 2008", "Q4 2008", "Q1 2009", "Q2 2009", "Q3 2009", "Q4 2009", "Q1 2010", "Q2 2010", "Q3 2010", "Q4 2010", "Q1 2011", "Q2 2011", "Q3 2011", "Q4 2011", "Q1 2012", "Q2 2012", "Q3 2012", "Q4 2012", "Q1 2013", "Q2 2013", "Q3 2013", "Q4 2013", "Q1 2014", "Q2 2014", "Q3 2014", "Q4 2014"))
str(data)
## Classes 'tbl_df', 'tbl' and 'data.frame':    113937 obs. of  81 variables:
##  $ ListingKey                         : Factor w/ 113066 levels "00003546482094282EF90E5",..: 7180 7193 6647 6669 6686 6689 6699 6706 6687 6687 ...
##  $ ListingNumber                      : int  193129 1209647 81716 658116 909464 1074836 750899 768193 1023355 1023355 ...
##  $ ListingCreationDate                : Date, format: "2007-08-26" "2014-02-27" ...
##  $ CreditGrade                        : Ord.factor w/ 8 levels "NC"<"HR"<"E"<..: 5 NA 2 NA NA NA NA NA NA NA ...
##  $ Term                               : int  36 36 36 36 36 60 36 36 36 36 ...
##  $ LoanStatus                         : Factor w/ 12 levels "Cancelled","Chargedoff",..: 3 4 3 4 4 4 4 4 4 4 ...
##  $ ClosedDate                         : Date, format: "2009-08-14" NA ...
##  $ BorrowerAPR                        : num  0.165 0.12 0.283 0.125 0.246 ...
##  $ BorrowerRate                       : num  0.158 0.092 0.275 0.0974 0.2085 ...
##  $ LenderYield                        : num  0.138 0.082 0.24 0.0874 0.1985 ...
##  $ EstimatedEffectiveYield            : num  NA 0.0796 NA 0.0849 0.1832 ...
##  $ EstimatedLoss                      : num  NA 0.0249 NA 0.0249 0.0925 ...
##  $ EstimatedReturn                    : num  NA 0.0547 NA 0.06 0.0907 ...
##  $ ProsperRating.num                  : int  NA 6 NA 6 3 5 2 4 7 7 ...
##  $ ProsperRating.alpha                : Ord.factor w/ 8 levels "NC"<"HR"<"E"<..: NA 7 NA 7 4 6 3 5 8 8 ...
##  $ ProsperScore                       : num  NA 7 NA 9 4 10 2 4 9 11 ...
##  $ ListingCategory.num                : Factor w/ 21 levels "0","1","2","3",..: 1 3 1 17 3 2 2 3 8 8 ...
##  $ BorrowerState                      : Factor w/ 51 levels "AK","AL","AR",..: 6 6 11 11 24 33 17 5 15 15 ...
##  $ Occupation                         : Factor w/ 67 levels "Accountant/CPA",..: 36 42 36 51 20 42 49 28 23 23 ...
##  $ EmploymentStatus                   : Factor w/ 8 levels "Employed","Full-time",..: 8 1 3 1 1 1 1 1 1 1 ...
##  $ EmploymentStatusDuration           : int  2 44 NA 113 44 82 172 103 269 269 ...
##  $ IsBorrowerHomeowner                : logi  TRUE FALSE FALSE TRUE TRUE TRUE ...
##  $ CurrentlyInGroup                   : logi  TRUE FALSE TRUE FALSE FALSE FALSE ...
##  $ GroupKey                           : Factor w/ 706 levels "00343376901312423168731",..: NA NA 334 NA NA NA NA NA NA NA ...
##  $ DateCreditPulled                   : Date, format: "2007-08-26" "2014-02-27" ...
##  $ CreditScoreRangeLower              : int  640 680 480 800 680 740 680 700 820 820 ...
##  $ CreditScoreRangeUpper              : int  659 699 499 819 699 759 699 719 839 839 ...
##  $ FirstRecordedCreditLine            : Date, format: "2001-10-11" "1996-03-18" ...
##  $ CurrentCreditLines                 : int  5 14 NA 5 19 21 10 6 17 17 ...
##  $ OpenCreditLines                    : int  4 14 NA 5 19 17 7 6 16 16 ...
##  $ TotalCreditLinespast7years         : int  12 29 3 29 49 49 20 10 32 32 ...
##  $ OpenRevolvingAccounts              : int  1 13 0 7 6 13 6 5 12 12 ...
##  $ OpenRevolvingMonthlyPayment        : num  24 389 0 115 220 1410 214 101 219 219 ...
##  $ InquiriesLast6Months               : int  3 3 0 0 1 0 0 3 1 1 ...
##  $ TotalInquiries                     : num  3 5 1 1 9 2 0 16 6 6 ...
##  $ CurrentDelinquencies               : int  2 0 1 4 0 0 0 0 0 0 ...
##  $ AmountDelinquent                   : num  472 0 NA 10056 0 ...
##  $ DelinquenciesLast7Years            : int  4 0 0 14 0 0 0 0 0 0 ...
##  $ PublicRecordsLast10Years           : int  0 1 0 0 0 0 0 1 0 0 ...
##  $ PublicRecordsLast12Months          : int  0 0 NA 0 0 0 0 0 0 0 ...
##  $ RevolvingCreditBalance             : num  0 3989 NA 1444 6193 ...
##  $ BankcardUtilization                : num  0 0.21 NA 0.04 0.81 0.39 0.72 0.13 0.11 0.11 ...
##  $ AvailableBankcardCredit            : num  1500 10266 NA 30754 695 ...
##  $ TotalTrades                        : num  11 29 NA 26 39 47 16 10 29 29 ...
##  $ TradesNeverDelinquent.per          : num  0.81 1 NA 0.76 0.95 1 0.68 0.8 1 1 ...
##  $ TradesOpenedLast6Months            : num  0 2 NA 0 2 0 0 0 1 1 ...
##  $ DebtToIncomeRatio                  : num  0.17 0.18 0.06 0.15 0.26 0.36 0.27 0.24 0.25 0.25 ...
##  $ IncomeRange                        : Ord.factor w/ 8 levels "Not displayed"<..: 5 6 1 5 8 8 5 5 5 5 ...
##  $ IncomeVerifiable                   : logi  TRUE TRUE TRUE TRUE TRUE TRUE ...
##  $ StatedMonthlyIncome                : num  3083 6125 2083 2875 9583 ...
##  $ LoanKey                            : Factor w/ 113066 levels "00003683605746079487FF7",..: 100337 69837 46303 70776 71387 86505 91250 5425 908 908 ...
##  $ TotalProsperLoans                  : int  NA NA NA NA 1 NA NA NA NA NA ...
##  $ TotalProsperPaymentsBilled         : int  NA NA NA NA 11 NA NA NA NA NA ...
##  $ OnTimeProsperPayments              : int  NA NA NA NA 11 NA NA NA NA NA ...
##  $ ProsperPaymentsLessThanOneMonthLate: int  NA NA NA NA 0 NA NA NA NA NA ...
##  $ ProsperPaymentsOneMonthPlusLate    : int  NA NA NA NA 0 NA NA NA NA NA ...
##  $ ProsperPrincipalBorrowed           : num  NA NA NA NA 11000 NA NA NA NA NA ...
##  $ ProsperPrincipalOutstanding        : num  NA NA NA NA 9948 ...
##  $ ScorexChangeAtTimeOfListing        : int  NA NA NA NA NA NA NA NA NA NA ...
##  $ LoanCurrentDaysDelinquent          : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ LoanFirstDefaultedCycleNumber      : int  NA NA NA NA NA NA NA NA NA NA ...
##  $ LoanMonthsSinceOrigination         : int  78 0 86 16 6 3 11 10 3 3 ...
##  $ LoanNumber                         : int  19141 134815 6466 77296 102670 123257 88353 90051 121268 121268 ...
##  $ LoanOriginalAmount                 : int  9425 10000 3001 10000 15000 15000 3000 10000 10000 10000 ...
##  $ LoanOriginationDate                : Date, format: "2007-09-12" "2014-03-03" ...
##  $ LoanOriginationQuarter             : Ord.factor w/ 36 levels "Q1 2006"<"Q2 2006"<..: 7 33 5 28 31 32 30 30 32 32 ...
##  $ MemberKey                          : Factor w/ 90831 levels "00003397697413387CAF966",..: 11071 10302 33781 54939 19465 48037 60448 40951 26129 26129 ...
##  $ MonthlyLoanPayment                 : num  330 319 123 321 564 ...
##  $ LP_CustomerPayments                : num  11396 0 4187 5143 2820 ...
##  $ LP_CustomerPrincipalPayments       : num  9425 0 3001 4091 1563 ...
##  $ LP_InterestandFees                 : num  1971 0 1186 1052 1257 ...
##  $ LP_ServiceFees                     : num  -133.2 0 -24.2 -108 -60.3 ...
##  $ LP_CollectionFees                  : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ LP_GrossPrincipalLoss              : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ LP_NetPrincipalLoss                : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ LP_NonPrincipalRecoverypayments    : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ PercentFunded                      : num  1 1 1 1 1 1 1 1 1 1 ...
##  $ Recommendations                    : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ InvestmentFromFriendsCount         : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ InvestmentFromFriendsAmount        : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ Investors                          : int  258 1 41 158 20 1 1 1 1 1 ...

First Impressions

Now I want to take a look at a summary of the data, to try to figure out what might be going on:

summary(data)
##                    ListingKey     ListingNumber     ListingCreationDate 
##  17A93590655669644DB4C06:     6   Min.   :      4   Min.   :2005-11-09  
##  349D3587495831350F0F648:     4   1st Qu.: 400919   1st Qu.:2008-09-19  
##  47C1359638497431975670B:     4   Median : 600554   Median :2012-06-16  
##  8474358854651984137201C:     4   Mean   : 627886   Mean   :2011-07-08  
##  DE8535960513435199406CE:     4   3rd Qu.: 892634   3rd Qu.:2013-09-09  
##  04C13599434217079754AEE:     3   Max.   :1255725   Max.   :2014-03-10  
##  (Other)                :113912                                         
##   CreditGrade         Term                       LoanStatus   
##  C      : 5649   Min.   :12.00   Current              :56576  
##  D      : 5153   1st Qu.:36.00   Completed            :38074  
##  B      : 4389   Median :36.00   Chargedoff           :11992  
##  AA     : 3509   Mean   :40.83   Defaulted            : 5018  
##  HR     : 3508   3rd Qu.:36.00   Past Due (1-15 days) :  806  
##  (Other): 6745   Max.   :60.00   Past Due (31-60 days):  363  
##  NA's   :84984                   (Other)              : 1108  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2005-11-25   Min.   :0.00653   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2009-07-14   1st Qu.:0.15629   1st Qu.:0.1340   1st Qu.: 0.1242  
##  Median :2011-04-05   Median :0.20976   Median :0.1840   Median : 0.1730  
##  Mean   :2011-03-07   Mean   :0.21883   Mean   :0.1928   Mean   : 0.1827  
##  3rd Qu.:2013-01-30   3rd Qu.:0.28381   3rd Qu.:0.2500   3rd Qu.: 0.2400  
##  Max.   :2014-03-10   Max.   :0.51229   Max.   :0.4975   Max.   : 0.4925  
##  NA's   :58848        NA's   :25                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.183          Min.   :0.005   Min.   :-0.183  
##  1st Qu.: 0.116          1st Qu.:0.042   1st Qu.: 0.074  
##  Median : 0.162          Median :0.072   Median : 0.092  
##  Mean   : 0.169          Mean   :0.080   Mean   : 0.096  
##  3rd Qu.: 0.224          3rd Qu.:0.112   3rd Qu.: 0.117  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :29084           NA's   :29084   NA's   :29084   
##  ProsperRating.num ProsperRating.alpha  ProsperScore   ListingCategory.num
##  Min.   :1.000     C      :18345       Min.   : 1.00   1      :58308      
##  1st Qu.:3.000     B      :15581       1st Qu.: 4.00   0      :16965      
##  Median :4.000     A      :14551       Median : 6.00   7      :10494      
##  Mean   :4.072     D      :14274       Mean   : 5.95   2      : 7433      
##  3rd Qu.:5.000     E      : 9795       3rd Qu.: 8.00   3      : 7189      
##  Max.   :7.000     (Other):12307       Max.   :11.00   6      : 2572      
##  NA's   :29084     NA's   :29084       NA's   :29084   (Other):10976      
##  BorrowerState                 Occupation         EmploymentStatus
##  CA     :14717   Other              :28617   Employed     :67322  
##  TX     : 6842   Professional       :13628   Full-time    :26355  
##  NY     : 6729   Computer Programmer: 4478   Self-employed: 6134  
##  FL     : 6720   Executive          : 4311   Not available: 5347  
##  IL     : 5921   Teacher            : 3759   Other        : 3806  
##  (Other):67493   (Other)            :55556   (Other)      : 2718  
##  NA's   : 5515   NA's               : 3588   NA's         : 2255  
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.00           Mode :logical       Mode :logical   
##  1st Qu.: 26.00           FALSE:56459         FALSE:101218    
##  Median : 67.00           TRUE :57478         TRUE :12719     
##  Mean   : 96.07                                               
##  3rd Qu.:137.00                                               
##  Max.   :755.00                                               
##  NA's   :7625                                                 
##                     GroupKey      DateCreditPulled    
##  783C3371218786870A73D20:  1140   Min.   :2005-11-09  
##  3D4D3366260257624AB272D:   916   1st Qu.:2008-09-16  
##  6A3B336601725506917317E:   698   Median :2012-06-17  
##  FEF83377364176536637E50:   611   Mean   :2011-07-09  
##  C9643379247860156A00EC0:   342   3rd Qu.:2013-09-11  
##  (Other)                :  9634   Max.   :2014-03-10  
##  NA's                   :100596                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1947-08-24     
##  1st Qu.:660.0         1st Qu.:679.0         1st Qu.:1990-06-01     
##  Median :680.0         Median :699.0         Median :1995-11-01     
##  Mean   :685.6         Mean   :704.6         Mean   :1994-11-17     
##  3rd Qu.:720.0         3rd Qu.:739.0         3rd Qu.:2000-03-14     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-12-22     
##  NA's   :591           NA's   :591           NA's   :697            
##  CurrentCreditLines OpenCreditLines TotalCreditLinespast7years
##  Min.   : 0.00      Min.   : 0.00   Min.   :  2.00            
##  1st Qu.: 7.00      1st Qu.: 6.00   1st Qu.: 17.00            
##  Median :10.00      Median : 9.00   Median : 25.00            
##  Mean   :10.32      Mean   : 9.26   Mean   : 26.75            
##  3rd Qu.:13.00      3rd Qu.:12.00   3rd Qu.: 35.00            
##  Max.   :59.00      Max.   :54.00   Max.   :136.00            
##  NA's   :7604       NA's   :7604    NA's   :697               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.00         Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 4.00         1st Qu.:  114.0             1st Qu.:  0.000     
##  Median : 6.00         Median :  271.0             Median :  1.000     
##  Mean   : 6.97         Mean   :  398.3             Mean   :  1.435     
##  3rd Qu.: 9.00         3rd Qu.:  525.0             3rd Qu.:  2.000     
##  Max.   :51.00         Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :697         
##  TotalInquiries    CurrentDelinquencies AmountDelinquent  
##  Min.   :  0.000   Min.   : 0.0000      Min.   :     0.0  
##  1st Qu.:  2.000   1st Qu.: 0.0000      1st Qu.:     0.0  
##  Median :  4.000   Median : 0.0000      Median :     0.0  
##  Mean   :  5.584   Mean   : 0.5921      Mean   :   984.5  
##  3rd Qu.:  7.000   3rd Qu.: 0.0000      3rd Qu.:     0.0  
##  Max.   :379.000   Max.   :83.0000      Max.   :463881.0  
##  NA's   :1159      NA's   :697          NA's   :7622      
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 4.155          Mean   : 0.3126         
##  3rd Qu.: 3.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :38.0000         
##  NA's   :990             NA's   :697             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   : 0.000            Min.   :      0        Min.   :0.000      
##  1st Qu.: 0.000            1st Qu.:   3121        1st Qu.:0.310      
##  Median : 0.000            Median :   8549        Median :0.600      
##  Mean   : 0.015            Mean   :  17599        Mean   :0.561      
##  3rd Qu.: 0.000            3rd Qu.:  19521        3rd Qu.:0.840      
##  Max.   :20.000            Max.   :1435667        Max.   :5.950      
##  NA's   :7604              NA's   :7604           NA's   :7604       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  0.00   Min.   :0.000            
##  1st Qu.:   880          1st Qu.: 15.00   1st Qu.:0.820            
##  Median :  4100          Median : 22.00   Median :0.940            
##  Mean   : 11210          Mean   : 23.23   Mean   :0.886            
##  3rd Qu.: 13180          3rd Qu.: 30.00   3rd Qu.:1.000            
##  Max.   :646285          Max.   :126.00   Max.   :1.000            
##  NA's   :7544            NA's   :7544     NA's   :7544             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange   
##  Min.   : 0.000          Min.   : 0.000    $25,000-49,999:32192  
##  1st Qu.: 0.000          1st Qu.: 0.140    $50,000-74,999:31050  
##  Median : 0.000          Median : 0.220    $100,000+     :17337  
##  Mean   : 0.802          Mean   : 0.276    $75,000-99,999:16916  
##  3rd Qu.: 1.000          3rd Qu.: 0.320    Not displayed : 7741  
##  Max.   :20.000          Max.   :10.010    $1-24,999     : 7274  
##  NA's   :7544            NA's   :8554      (Other)       : 1427  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey      
##  Mode :logical    Min.   :      0     CB1B37030986463208432A1:     6  
##  FALSE:8669       1st Qu.:   3200     2DEE3698211017519D7333F:     4  
##  TRUE :105268     Median :   4667     9F4B37043517554537C364C:     4  
##                   Mean   :   5608     D895370150591392337ED6D:     4  
##                   3rd Qu.:   6825     E6FB37073953690388BC56D:     4  
##                   Max.   :1750003     0D8F37036734373301ED419:     3  
##                                       (Other)                :113912  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :0.00      Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.00      1st Qu.:  9.00             1st Qu.:  9.00       
##  Median :1.00      Median : 16.00             Median : 15.00       
##  Mean   :1.42      Mean   : 22.93             Mean   : 22.27       
##  3rd Qu.:2.00      3rd Qu.: 33.00             3rd Qu.: 32.00       
##  Max.   :8.00      Max.   :141.00             Max.   :141.00       
##  NA's   :91852     NA's   :91852              NA's   :91852        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.00                       Min.   : 0.00                  
##  1st Qu.: 0.00                       1st Qu.: 0.00                  
##  Median : 0.00                       Median : 0.00                  
##  Mean   : 0.61                       Mean   : 0.05                  
##  3rd Qu.: 0.00                       3rd Qu.: 0.00                  
##  Max.   :42.00                       Max.   :21.00                  
##  NA's   :91852                       NA's   :91852                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   :    0            Min.   :    0              
##  1st Qu.: 3500            1st Qu.:    0              
##  Median : 6000            Median : 1627              
##  Mean   : 8472            Mean   : 2930              
##  3rd Qu.:11000            3rd Qu.: 4127              
##  Max.   :72499            Max.   :23451              
##  NA's   :91852            NA's   :91852              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-209.00             Min.   :   0.0           
##  1st Qu.: -35.00             1st Qu.:   0.0           
##  Median :  -3.00             Median :   0.0           
##  Mean   :  -3.22             Mean   : 152.8           
##  3rd Qu.:  25.00             3rd Qu.:   0.0           
##  Max.   : 286.00             Max.   :2704.0           
##  NA's   :95009                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 0.00                 Min.   :  0.0              Min.   :     1  
##  1st Qu.: 9.00                 1st Qu.:  6.0              1st Qu.: 37332  
##  Median :14.00                 Median : 21.0              Median : 68599  
##  Mean   :16.27                 Mean   : 31.9              Mean   : 69444  
##  3rd Qu.:22.00                 3rd Qu.: 65.0              3rd Qu.:101901  
##  Max.   :44.00                 Max.   :100.0              Max.   :136486  
##  NA's   :96985                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-15   Q4 2013:14450         
##  1st Qu.: 4000      1st Qu.:2008-10-02   Q1 2014:12172         
##  Median : 6500      Median :2012-06-26   Q3 2013: 9180         
##  Mean   : 8337      Mean   :2011-07-21   Q2 2013: 7099         
##  3rd Qu.:12000      3rd Qu.:2013-09-18   Q3 2012: 5632         
##  Max.   :35000      Max.   :2014-03-12   (Other):65382         
##                                          NA's   :   22         
##                    MemberKey      MonthlyLoanPayment LP_CustomerPayments
##  63CA34120866140639431C9:     9   Min.   :   0.0     Min.   :   -2.35   
##  16083364744933457E57FB9:     8   1st Qu.: 131.6     1st Qu.: 1005.76   
##  3A2F3380477699707C81385:     8   Median : 217.7     Median : 2583.83   
##  4D9C3403302047712AD0CDD:     8   Mean   : 272.5     Mean   : 4183.08   
##  739C338135235294782AE75:     8   3rd Qu.: 371.6     3rd Qu.: 5548.40   
##  7E1733653050264822FAA3D:     8   Max.   :2251.5     Max.   :40702.39   
##  (Other)                :113888                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0.0              Min.   :   -2.35   Min.   :-664.87  
##  1st Qu.:  500.9              1st Qu.:  274.87   1st Qu.: -73.18  
##  Median : 1587.5              Median :  700.84   Median : -34.44  
##  Mean   : 3105.5              Mean   : 1077.54   Mean   : -54.73  
##  3rd Qu.: 4000.0              3rd Qu.: 1458.54   3rd Qu.: -13.92  
##  Max.   :35000.0              Max.   :15617.03   Max.   :  32.06  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.:    0.00   1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :    0.00   Median :    0.0       Median :    0.0    
##  Mean   :  -14.24   Mean   :  700.4       Mean   :  681.4    
##  3rd Qu.:    0.00   3rd Qu.:    0.0       3rd Qu.:    0.0    
##  Max.   :    0.00   Max.   :25000.0       Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :    0.00                Min.   :0.7000   Min.   : 0.00000  
##  1st Qu.:    0.00                1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :    0.00                Median :1.0000   Median : 0.00000  
##  Mean   :   25.14                Mean   :0.9986   Mean   : 0.04803  
##  3rd Qu.:    0.00                3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :21117.90                Max.   :1.0125   Max.   :39.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors      
##  Min.   : 0.00000           Min.   :    0.00            Min.   :   1.00  
##  1st Qu.: 0.00000           1st Qu.:    0.00            1st Qu.:   2.00  
##  Median : 0.00000           Median :    0.00            Median :  44.00  
##  Mean   : 0.02346           Mean   :   16.55            Mean   :  80.48  
##  3rd Qu.: 0.00000           3rd Qu.:    0.00            3rd Qu.: 115.00  
##  Max.   :33.00000           Max.   :25000.00            Max.   :1189.00  
## 

I first see that there’s a lot of missing data in many of the columns - it’s not clear to me immediately whether this indicates that the data for those rows is truly missing (but theoretically could have been gathered), or if the information in those columns was simply not applicable to those rows. I will sort this out as I move through the data, but I want to see if some information is, for example, only entered once the loan has been closed or completed. First, though, I will identify the factors of interest.

Factors of Interest

Prosper Loans, through cursory research (https://en.wikipedia.org/wiki/Prosper_Marketplace), appears to be a peer-to-peer lending company. The primary concern of companies is profit, and in this case, as I see no obvious measure of profit to the company itself, I will focus on profit to the lender (the lenders, presumably, keep the company in business). Of course, borrowers likewise keep the company in business, and given the measures collected, it’s possible to at least take a look at how borrower demographics influence loan funding. Variable names are cross-referenced with a document linked from the Kaggle site: https://docs.google.com/spreadsheets/d/1gDyi_L4UvIrLTEC6Wri5nbaMmkGmLQBk-Yx3z0XDEtI/edit#gid=0.

The factors of most interest to lenders, I assume, might be (for example) LoanStatus (whether a loan is in good standing, repaid, or written off, etc.), LenderYield (yield minus servicing fee), EstimatedEffectiveYield (yield minus servicing fee and uncollected interest, and plus late fees) - likely more informative than the preceding, EstimatedReturn (overall estimated return, taking into account both estimated yield, and estimated loss), EstimatedLoss (loss on charge-offs), LoanCurrentDaysDelinquent, LP_GrossPrincipalLoss, and LP_NetPrincipalLoss. These seem most indicative of how much lenders might profit, or lose, from any particular borrower. What the lender should care most about, overall, is the ability to predict whether (or to what degree) a given (current or future) loan will pay off. In some cases, it is unclear from the documentation whether these are predictions assigned by Prosper at the outset, or descriptions of what actually happened during the course of loans. Exploring the data might shed some light on this.

On the other hand, the factors I intuitively expect might be predictive of profit are the following (for example): CreditGrade (credit assigned when the listing went live), ProsperRating (rating assigned when the loan went live), ProsperScore (risk score), EstimatedReturn (predicted difference between estimated effective yield and estimated loss), ListingCategory (what the loan is for), Occupation, EmploymentStatus, EmploymentStatusDuration, IsBorrowerHomeowner, CreditScoreRangeLower/CreditScoreRangeUpper, FirstRecordedCreditLine, CurrentCreditLines, OpenCreditLines, TotalCreditLinespast7years, OpenRevolvingAccounts, OpenRevolvingMonthlyPayment, InquiriesLast6Months, TotalInquiries, CurrentDelinquencies, AmountDelinquent, DelinquenciesLast7Years, PublicRecordsLast10Years, PublicRecordsLast12Months, RevolvingCreditBalance, BankcardUtilization, AvailableBankcardCredit, TotalTrades (number of trade lines ever opened), TradesNeverDelinquent, TradesOpenedLast6Months, DebtToIncomeRatio, IncomeRange, IncomeVerifiable, StatedMonthlyIncome, TotalProsperLoans (prior Prosper loans), TotalProsperPaymentsBilled (presumably, number of payments billed at time of listing), OnTimeProsperPayments (number of on-time payments at time of listing), ProsperPaymentsLessThanOneMonthLate, ProsperPaymentsOneMonthPlusLate, ProsperPrincipalBorrowed (amount borrowed at time of listing), ProsperPrincipalOutstanding (amount outstanding at time of listing), Recommendations (number of recommendations at time of listing), InvestmentFromFriendsCount (number of friends investing), andInvestmentFromFriendsAmount (amount invested by friends), and Investors (total number of investors). There are too many of these categories, and I expect to narrow the list I will look at down to a few, particularly when multiple measures reflect more-or-less the same thing, or don’t show any distinct patterns of correlating with other variables.

With respect to loan funding, some of the same predictors likely also influence loan amounts and borrower funding, as most likely reflected by BorrowerAPR, BorrowerRate, LoanOriginalAmount, MonthlyLoanPayment, Term (the length of the loan), and PercentFunded (although this is likely to not be informative for recently created loans).

The borrowers and loans are primary indexed through the variables MemberKey and LoanNumber. Additional variables for keeping track of loans include LoanOriginationDate and LoanOriginationQuarter. ClosedDate is useful for quickly indexing loans which have been closed, and for which firm conclusions can be drawn as to how much lenders profited.

NA Values

Here I want to double-check why information might be missing (e.g., whether some variables are assigned only once a loan has been closed).

closed <- round(colMeans(is.na(filter(data, !is.na(ClosedDate))))*100,2)
not_closed <- round(colMeans(is.na(filter(data, is.na(ClosedDate))))*100,2)
data.frame(closed, not_closed)
##                                     closed not_closed
## ListingKey                            0.00       0.00
## ListingNumber                         0.00       0.00
## ListingCreationDate                   0.00       0.00
## CreditGrade                          47.44     100.00
## Term                                  0.00       0.00
## LoanStatus                            0.00       0.00
## ClosedDate                            0.00     100.00
## BorrowerAPR                           0.05       0.00
## BorrowerRate                          0.00       0.00
## LenderYield                           0.00       0.00
## EstimatedEffectiveYield              52.79       0.00
## EstimatedLoss                        52.79       0.00
## EstimatedReturn                      52.79       0.00
## ProsperRating.num                    52.79       0.00
## ProsperRating.alpha                  52.79       0.00
## ProsperScore                         52.79       0.00
## ListingCategory.num                   0.00       0.00
## BorrowerState                        10.01       0.00
## Occupation                            4.12       2.24
## EmploymentStatus                      4.09       0.00
## EmploymentStatusDuration             13.82       0.02
## IsBorrowerHomeowner                   0.00       0.00
## CurrentlyInGroup                      0.00       0.00
## GroupKey                             77.00      98.86
## DateCreditPulled                      0.00       0.00
## CreditScoreRangeLower                 1.07       0.00
## CreditScoreRangeUpper                 1.07       0.00
## FirstRecordedCreditLine               1.27       0.00
## CurrentCreditLines                   13.80       0.00
## OpenCreditLines                      13.80       0.00
## TotalCreditLinespast7years            1.27       0.00
## OpenRevolvingAccounts                 0.00       0.00
## OpenRevolvingMonthlyPayment           0.00       0.00
## InquiriesLast6Months                  1.27       0.00
## TotalInquiries                        2.10       0.00
## CurrentDelinquencies                  1.27       0.00
## AmountDelinquent                     13.84       0.00
## DelinquenciesLast7Years               1.80       0.00
## PublicRecordsLast10Years              1.27       0.00
## PublicRecordsLast12Months            13.80       0.00
## RevolvingCreditBalance               13.80       0.00
## BankcardUtilization                  13.80       0.00
## AvailableBankcardCredit              13.69       0.00
## TotalTrades                          13.69       0.00
## TradesNeverDelinquent.per            13.69       0.00
## TradesOpenedLast6Months              13.69       0.00
## DebtToIncomeRatio                     7.68       7.35
## IncomeRange                           0.00       0.00
## IncomeVerifiable                      0.00       0.00
## StatedMonthlyIncome                   0.00       0.00
## LoanKey                               0.00       0.00
## TotalProsperLoans                    80.87      80.38
## TotalProsperPaymentsBilled           80.87      80.38
## OnTimeProsperPayments                80.87      80.38
## ProsperPaymentsLessThanOneMonthLate  80.87      80.38
## ProsperPaymentsOneMonthPlusLate      80.87      80.38
## ProsperPrincipalBorrowed             80.87      80.38
## ProsperPrincipalOutstanding          80.87      80.38
## ScorexChangeAtTimeOfListing          81.05      85.58
## LoanCurrentDaysDelinquent             0.00       0.00
## LoanFirstDefaultedCycleNumber        69.24      99.99
## LoanMonthsSinceOrigination            0.00       0.00
## LoanNumber                            0.00       0.00
## LoanOriginalAmount                    0.00       0.00
## LoanOriginationDate                   0.00       0.00
## LoanOriginationQuarter                0.04       0.00
## MemberKey                             0.00       0.00
## MonthlyLoanPayment                    0.00       0.00
## LP_CustomerPayments                   0.00       0.00
## LP_CustomerPrincipalPayments          0.00       0.00
## LP_InterestandFees                    0.00       0.00
## LP_ServiceFees                        0.00       0.00
## LP_CollectionFees                     0.00       0.00
## LP_GrossPrincipalLoss                 0.00       0.00
## LP_NetPrincipalLoss                   0.00       0.00
## LP_NonPrincipalRecoverypayments       0.00       0.00
## PercentFunded                         0.00       0.00
## Recommendations                       0.00       0.00
## InvestmentFromFriendsCount            0.00       0.00
## InvestmentFromFriendsAmount           0.00       0.00
## Investors                             0.00       0.00

The first thing I notice is that whether a loan is closed, or not, is quite, but in most cases not entirely, predictive of whether missing values are present, or not.

None of the open loans have a credit grade, while about half of the closed loans do. I assume that those which do are post-July 2009 loans, which were never assigned a credit grade.

summary(filter(data, !is.na(ClosedDate) & is.na(CreditGrade)))
##                    ListingKey    ListingNumber     ListingCreationDate 
##  018A360063948152589C8BE:    2   Min.   : 149172   Min.   :2007-06-08  
##  30F435938764424435A1188:    2   1st Qu.: 479472   1st Qu.:2010-10-12  
##  32943590099161153292459:    2   Median : 529900   Median :2011-09-28  
##  6DFC3591891372387BB41B2:    2   Mean   : 554859   Mean   :2011-08-17  
##  778D35919242972923313E0:    2   3rd Qu.: 600118   3rd Qu.:2012-06-14  
##  82FD35914405776692938D4:    2   Max.   :1204824   Max.   :2014-02-13  
##  (Other)                :26124                                         
##   CreditGrade         Term                        LoanStatus   
##  NC     :    0   Min.   :12.00   Completed             :19786  
##  HR     :    0   1st Qu.:36.00   Chargedoff            : 5342  
##  E      :    0   Median :36.00   Defaulted             : 1008  
##  D      :    0   Mean   :37.99   Cancelled             :    0  
##  C      :    0   3rd Qu.:36.00   Current               :    0  
##  (Other):    0   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :26136                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield    
##  Min.   :2009-08-27   Min.   :0.04583   Min.   :0.0400   Min.   :0.0300  
##  1st Qu.:2012-06-12   1st Qu.:0.17359   1st Qu.:0.1469   1st Qu.:0.1369  
##  Median :2013-02-20   Median :0.26798   Median :0.2300   Median :0.2200  
##  Mean   :2012-12-20   Mean   :0.25118   Mean   :0.2193   Mean   :0.2093  
##  3rd Qu.:2013-09-10   3rd Qu.:0.33553   3rd Qu.:0.2958   3rd Qu.:0.2858  
##  Max.   :2014-03-10   Max.   :0.42395   Max.   :0.3600   Max.   :0.3400  
##                                                                          
##  EstimatedEffectiveYield EstimatedLoss     EstimatedReturn  
##  Min.   :-0.1827         Min.   :0.00490   Min.   :-0.1827  
##  1st Qu.: 0.1106         1st Qu.:0.05200   1st Qu.: 0.0780  
##  Median : 0.1715         Median :0.09800   Median : 0.1144  
##  Mean   : 0.1762         Mean   :0.09379   Mean   : 0.1075  
##  3rd Qu.: 0.2469         3rd Qu.:0.14050   3rd Qu.: 0.1363  
##  Max.   : 0.3199         Max.   :0.36600   Max.   : 0.2837  
##  NA's   :131             NA's   :131       NA's   :131      
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      :5869        Min.   : 1.000  
##  1st Qu.:2.000     E      :3830        1st Qu.: 5.000  
##  Median :3.000     C      :3817        Median : 6.000  
##  Mean   :3.663     HR     :3725        Mean   : 6.266  
##  3rd Qu.:5.000     A      :3608        3rd Qu.: 8.000  
##  Max.   :7.000     (Other):5156        Max.   :11.000  
##  NA's   :131       NA's   : 131        NA's   :131     
##  ListingCategory.num BorrowerState                      Occupation   
##  Min.   : 0.00       CA     : 3325   Other                   : 6786  
##  1st Qu.: 1.00       FL     : 1768   Professional            : 3452  
##  Median : 2.00       NY     : 1639   Computer Programmer     : 1261  
##  Mean   : 3.75       TX     : 1562   Administrative Assistant:  959  
##  3rd Qu.: 7.00       IL     : 1389   Executive               :  950  
##  Max.   :20.00       GA     : 1127   (Other)                 :12715  
##                      (Other):15326   NA's                    :   13  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Employed     :16491   Min.   :  0.00           Mode :logical      
##  Full-time    : 6634   1st Qu.: 27.00           FALSE:12814        
##  Self-employed: 1334   Median : 63.00           TRUE :13322        
##  Other        :  798   Mean   : 91.06                              
##  Not employed :  375   3rd Qu.:127.00                              
##  Retired      :  273   Max.   :755.00                              
##  (Other)      :  231   NA's   :9                                   
##  CurrentlyInGroup                    GroupKey     DateCreditPulled    
##  Mode :logical    3D4D3366260257624AB272D:  201   Min.   :2009-07-13  
##  FALSE:24741      783C3371218786870A73D20:  134   1st Qu.:2010-10-13  
##  TRUE :1395       52EA3425051368132B80C96:  109   Median :2011-09-29  
##                   B0473364376920128370B13:   63   Mean   :2011-08-21  
##                   FEF83377364176536637E50:   54   3rd Qu.:2012-06-14  
##                   (Other)                :  817   Max.   :2014-02-13  
##                   NA's                   :24758                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :600.0         Min.   :619.0         Min.   :1953-09-01     
##  1st Qu.:660.0         1st Qu.:679.0         1st Qu.:1990-12-03     
##  Median :700.0         Median :719.0         Median :1996-04-16     
##  Mean   :701.7         Mean   :720.7         Mean   :1995-04-06     
##  3rd Qu.:740.0         3rd Qu.:759.0         3rd Qu.:2000-05-19     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-06-19     
##                                                                     
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.0             
##  1st Qu.: 6.000     1st Qu.: 5.000   1st Qu.: 16.0             
##  Median : 9.000     Median : 8.000   Median : 25.0             
##  Mean   : 9.576     Mean   : 8.454   Mean   : 26.6             
##  3rd Qu.:13.000     3rd Qu.:11.000   3rd Qu.: 35.0             
##  Max.   :59.000     Max.   :48.000   Max.   :124.0             
##                                                                
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :   0.0              Min.   : 0.000      
##  1st Qu.: 3.000        1st Qu.:  97.0              1st Qu.: 0.000      
##  Median : 6.000        Median : 231.0              Median : 1.000      
##  Mean   : 6.442        Mean   : 349.2              Mean   : 1.188      
##  3rd Qu.: 9.000        3rd Qu.: 457.0              3rd Qu.: 2.000      
##  Max.   :47.000        Max.   :5720.0              Max.   :27.000      
##                                                                        
##  TotalInquiries   CurrentDelinquencies AmountDelinquent  
##  Min.   : 0.000   Min.   : 0.0000      Min.   :     0.0  
##  1st Qu.: 2.000   1st Qu.: 0.0000      1st Qu.:     0.0  
##  Median : 4.000   Median : 0.0000      Median :     0.0  
##  Mean   : 4.646   Mean   : 0.3694      Mean   :   992.6  
##  3rd Qu.: 6.000   3rd Qu.: 0.0000      3rd Qu.:     0.0  
##  Max.   :74.000   Max.   :32.0000      Max.   :327677.0  
##                                                          
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 3.401          Mean   : 0.2609         
##  3rd Qu.: 2.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :12.0000         
##                                                  
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.00000           Min.   :     0         Min.   :0.0000     
##  1st Qu.:0.00000           1st Qu.:  2071         1st Qu.:0.2200     
##  Median :0.00000           Median :  6798         Median :0.5400     
##  Mean   :0.01144           Mean   : 15210         Mean   :0.5141     
##  3rd Qu.:0.00000           3rd Qu.: 16600         3rd Qu.:0.8100     
##  Max.   :4.00000           Max.   :879785         Max.   :2.5000     
##                                                                      
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0.0        Min.   :  1.00   Min.   :0.1600           
##  1st Qu.:   850.8        1st Qu.: 14.00   1st Qu.:0.8300           
##  Median :  4198.0        Median : 21.00   Median :0.9500           
##  Mean   : 11174.3        Mean   : 22.87   Mean   :0.8973           
##  3rd Qu.: 13414.0        3rd Qu.: 30.00   3rd Qu.:1.0000           
##  Max.   :412785.0        Max.   :122.00   Max.   :1.0000           
##                                                                    
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.0000         Min.   : 0.0000   $25,000-49,999:8367  
##  1st Qu.: 0.0000         1st Qu.: 0.1300   $50,000-74,999:7411  
##  Median : 0.0000         Median : 0.2000   $75,000-99,999:4041  
##  Mean   : 0.7603         Mean   : 0.2488   $100,000+     :3948  
##  3rd Qu.: 1.0000         3rd Qu.: 0.3000   $1-24,999     :1964  
##  Max.   :20.0000         Max.   :10.0100   Not employed  : 375  
##                          NA's   :2983      (Other)       :  30  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      08C43696561586194AC381C:    2  
##  FALSE:2976       1st Qu.:  3167      09303699897852595CD59DD:    2  
##  TRUE :23160      Median :  4583      114D37056655628721BD6C8:    2  
##                   Mean   :  5488      156836977849742636AE34F:    2  
##                   3rd Qu.:  6667      56D73700259224545E36FBC:    2  
##                   Max.   :618548      63113695530739927C7EA06:    2  
##                                       (Other)                :26124  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :0.000     Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.000     1st Qu.:  9.00             1st Qu.:  9.00       
##  Median :1.000     Median : 18.00             Median : 18.00       
##  Mean   :1.401     Mean   : 22.57             Mean   : 21.88       
##  3rd Qu.:2.000     3rd Qu.: 33.00             3rd Qu.: 32.00       
##  Max.   :7.000     Max.   :120.00             Max.   :114.00       
##  NA's   :17826     NA's   :17826              NA's   :17826        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   : 0.000                 
##  1st Qu.: 0.000                      1st Qu.: 0.000                 
##  Median : 0.000                      Median : 0.000                 
##  Mean   : 0.635                      Mean   : 0.058                 
##  3rd Qu.: 0.000                      3rd Qu.: 0.000                 
##  Max.   :42.000                      Max.   :21.000                 
##  NA's   :17826                       NA's   :17826                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   :    0            Min.   :    0.0            
##  1st Qu.: 3000            1st Qu.:    0.0            
##  Median : 5000            Median :  824.7            
##  Mean   : 7394            Mean   : 2127.9            
##  3rd Qu.:10000            3rd Qu.: 3179.1            
##  Max.   :60001            Max.   :22586.7            
##  NA's   :17826            NA's   :17826              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-194.00             Min.   :   0.0           
##  1st Qu.: -32.00             1st Qu.:   0.0           
##  Median :  -3.00             Median :   0.0           
##  Mean   :  -0.29             Mean   : 115.9           
##  3rd Qu.:  29.00             3rd Qu.:   0.0           
##  Max.   : 286.00             Max.   :1593.0           
##  NA's   :17923                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 1.00                 Min.   : 1.00              Min.   : 38045  
##  1st Qu.: 9.00                 1st Qu.:21.00              1st Qu.: 45089  
##  Median :13.00                 Median :29.00              Median : 54430  
##  Mean   :14.49                 Mean   :30.47              Mean   : 58559  
##  3rd Qu.:19.00                 3rd Qu.:41.00              3rd Qu.: 68482  
##  Max.   :41.00                 Max.   :56.00              Max.   :132453  
##  NA's   :19891                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2009-07-20   Q4 2011: 2352         
##  1st Qu.: 3000      1st Qu.:2010-10-29   Q2 2012: 2272         
##  Median : 4500      Median :2011-10-12   Q1 2012: 2252         
##  Mean   : 6365      Mean   :2011-09-03   Q3 2012: 2213         
##  3rd Qu.: 8000      3rd Qu.:2012-06-25   Q3 2011: 2018         
##  Max.   :35000      Max.   :2014-02-21   Q2 2011: 1713         
##                                          (Other):13316         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  C70934206057523078260C7:    7   Min.   :   0.0     Min.   :   -2.35   
##  E4AF3422677498955FFA00E:    7   1st Qu.: 121.6     1st Qu.: 2304.53   
##  720D3508651090808DC328F:    6   Median : 175.9     Median : 4561.31   
##  D65B3496915385104F50CD7:    6   Mean   : 232.2     Mean   : 6193.82   
##  E48334334509567416C8C65:    6   3rd Qu.: 314.4     3rd Qu.: 8501.98   
##  43DB3366978035224D7D9E3:    5   Max.   :2251.5     Max.   :37369.16   
##  (Other)                :26099                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0                Min.   :   -2.35   Min.   :-589.95  
##  1st Qu.: 1795                1st Qu.:  326.71   1st Qu.: -70.74  
##  Median : 4000                Median :  746.15   Median : -35.07  
##  Mean   : 5128                Mean   : 1065.72   Mean   : -52.18  
##  3rd Qu.: 7000                3rd Qu.: 1487.20   3rd Qu.: -16.07  
##  Max.   :35000                Max.   :10013.57   Max.   :   3.01  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-4865.08   Min.   :  -94.2       Min.   : -504.4    
##  1st Qu.:    0.00   1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :    0.00   Median :    0.0       Median :    0.0    
##  Mean   :  -17.25   Mean   : 1221.7       Mean   : 1194.6    
##  3rd Qu.:    0.00   3rd Qu.:    0.0       3rd Qu.:    0.0    
##  Max.   :    0.00   Max.   :25000.0       Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded   Recommendations   
##  Min.   :   0.00                 Min.   :0.700   Min.   : 0.00000  
##  1st Qu.:   0.00                 1st Qu.:1.000   1st Qu.: 0.00000  
##  Median :   0.00                 Median :1.000   Median : 0.00000  
##  Mean   :  24.83                 Mean   :0.997   Mean   : 0.03646  
##  3rd Qu.:   0.00                 3rd Qu.:1.000   3rd Qu.: 0.00000  
##  Max.   :7780.03                 Max.   :1.000   Max.   :18.00000  
##                                                                    
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors      
##  Min.   :0.00000            Min.   :    0.00            Min.   :   1.00  
##  1st Qu.:0.00000            1st Qu.:    0.00            1st Qu.:  28.00  
##  Median :0.00000            Median :    0.00            Median :  62.00  
##  Mean   :0.02124            Mean   :   12.94            Mean   :  92.67  
##  3rd Qu.:0.00000            3rd Qu.:    0.00            3rd Qu.: 125.00  
##  Max.   :9.00000            Max.   :11000.00            Max.   :1189.00  
## 

Here, I see that at least one loan prior to 2009 has no credit grade.

summary(filter(data, !is.na(ClosedDate) & is.na(CreditGrade) & ListingCreationDate < "2009-07-01"))
##                    ListingKey  ListingNumber    ListingCreationDate 
##  0385345033494662260733C:  1   Min.   :149172   Min.   :2007-06-08  
##  04D73431953660481B1EC1D:  1   1st Qu.:306608   1st Qu.:2008-04-08  
##  04F334232790941784498F1:  1   Median :339464   Median :2008-05-26  
##  05153419481232978723A5F:  1   Mean   :341138   Mean   :2008-06-24  
##  059934165217732065237C5:  1   3rd Qu.:397924   3rd Qu.:2008-09-13  
##  06FF342963152332574DF05:  1   Max.   :415961   Max.   :2009-05-06  
##  (Other)                :125                                        
##   CreditGrade       Term                        LoanStatus 
##  NC     :  0   Min.   :12.00   Completed             :122  
##  HR     :  0   1st Qu.:36.00   Chargedoff            :  6  
##  E      :  0   Median :36.00   Defaulted             :  3  
##  D      :  0   Mean   :35.82   Cancelled             :  0  
##  C      :  0   3rd Qu.:36.00   Current               :  0  
##  (Other):  0   Max.   :36.00   FinalPaymentInProgress:  0  
##  NA's   :131                   (Other)               :  0  
##    ClosedDate          BorrowerAPR       BorrowerRate    
##  Min.   :2010-01-28   Min.   :0.06207   Min.   :0.05870  
##  1st Qu.:2011-04-21   1st Qu.:0.11271   1st Qu.:0.09025  
##  Median :2012-04-05   Median :0.17018   Median :0.14000  
##  Mean   :2012-02-01   Mean   :0.18688   Mean   :0.16300  
##  3rd Qu.:2012-10-29   3rd Qu.:0.25811   3rd Qu.:0.22700  
##  Max.   :2013-10-12   Max.   :0.39460   Max.   :0.35300  
##                                                          
##   LenderYield      EstimatedEffectiveYield EstimatedLoss EstimatedReturn
##  Min.   :0.04870   Min.   : NA             Min.   : NA   Min.   : NA    
##  1st Qu.:0.08025   1st Qu.: NA             1st Qu.: NA   1st Qu.: NA    
##  Median :0.13000   Median : NA             Median : NA   Median : NA    
##  Mean   :0.15293   Mean   :NaN             Mean   :NaN   Mean   :NaN    
##  3rd Qu.:0.21700   3rd Qu.: NA             3rd Qu.: NA   3rd Qu.: NA    
##  Max.   :0.34000   Max.   : NA             Max.   : NA   Max.   : NA    
##                    NA's   :131             NA's   :131   NA's   :131    
##  ProsperRating.num ProsperRating.alpha  ProsperScore ListingCategory.num
##  Min.   : NA       NC     :  0         Min.   : NA   Min.   :1.000      
##  1st Qu.: NA       HR     :  0         1st Qu.: NA   1st Qu.:1.000      
##  Median : NA       E      :  0         Median : NA   Median :1.000      
##  Mean   :NaN       D      :  0         Mean   :NaN   Mean   :2.893      
##  3rd Qu.: NA       C      :  0         3rd Qu.: NA   3rd Qu.:5.000      
##  Max.   : NA       (Other):  0         Max.   : NA   Max.   :7.000      
##  NA's   :131       NA's   :131         NA's   :131                      
##  BorrowerState                    Occupation      EmploymentStatus
##  CA     :18    Other                   :30   Full-time    :104    
##  TX     :18    Professional            :23   Employed     : 12    
##  NY     : 9    Analyst                 : 9   Part-time    :  7    
##  IL     : 7    Computer Programmer     : 9   Retired      :  4    
##  CT     : 6    Administrative Assistant: 5   Self-employed:  4    
##  MN     : 6    Teacher                 : 5   Not available:  0    
##  (Other):67    (Other)                 :50   (Other)      :  0    
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.00           Mode :logical       Mode :logical   
##  1st Qu.: 26.00           FALSE:66            FALSE:107       
##  Median : 50.00           TRUE :65            TRUE :24        
##  Mean   : 74.24                                               
##  3rd Qu.:105.00                                               
##  Max.   :472.00                                               
##                                                               
##                     GroupKey   DateCreditPulled     CreditScoreRangeLower
##  783C3371218786870A73D20:  5   Min.   :2009-07-13   Min.   :600.0        
##  020E3366126106360DB9421:  1   1st Qu.:2009-10-19   1st Qu.:660.0        
##  17693364417023401A53169:  1   Median :2010-02-03   Median :720.0        
##  18DA336463918236939DCE7:  1   Mean   :2010-02-23   Mean   :711.1        
##  3D4D3366260257624AB272D:  1   3rd Qu.:2010-07-02   3rd Qu.:740.0        
##  (Other)                : 15   Max.   :2010-12-19   Max.   :860.0        
##  NA's                   :107                                             
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   :619.0         Min.   :1959-10-01      Min.   : 1.00     
##  1st Qu.:679.0         1st Qu.:1992-12-11      1st Qu.: 7.00     
##  Median :739.0         Median :1996-08-28      Median : 9.00     
##  Mean   :730.1         Mean   :1995-06-17      Mean   :10.27     
##  3rd Qu.:759.0         3rd Qu.:2000-04-07      3rd Qu.:13.00     
##  Max.   :879.0         Max.   :2007-09-10      Max.   :35.00     
##                                                                  
##  OpenCreditLines  TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 1.000   Min.   : 4.00              Min.   : 0.000       
##  1st Qu.: 5.000   1st Qu.:17.00              1st Qu.: 4.000       
##  Median : 8.000   Median :22.00              Median : 6.000       
##  Mean   : 8.832   Mean   :25.51              Mean   : 6.855       
##  3rd Qu.:12.000   3rd Qu.:33.00              3rd Qu.: 9.000       
##  Max.   :29.000   Max.   :58.00              Max.   :29.000       
##                                                                   
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries  
##  Min.   :   0.0              Min.   :0.000        Min.   : 0.000  
##  1st Qu.:  90.5              1st Qu.:0.000        1st Qu.: 2.000  
##  Median : 239.0              Median :0.000        Median : 4.000  
##  Mean   : 309.1              Mean   :0.855        Mean   : 5.191  
##  3rd Qu.: 420.0              3rd Qu.:1.000        3rd Qu.: 8.000  
##  Max.   :1956.0              Max.   :9.000        Max.   :19.000  
##                                                                   
##  CurrentDelinquencies AmountDelinquent  DelinquenciesLast7Years
##  Min.   :0.0000       Min.   :    0.0   Min.   : 0.000         
##  1st Qu.:0.0000       1st Qu.:    0.0   1st Qu.: 0.000         
##  Median :0.0000       Median :    0.0   Median : 0.000         
##  Mean   :0.2824       Mean   :  433.7   Mean   : 2.718         
##  3rd Qu.:0.0000       3rd Qu.:    0.0   3rd Qu.: 0.000         
##  Max.   :8.0000       Max.   :31919.0   Max.   :43.000         
##                                                                
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   :0.0000           Min.   :0                 Min.   :    0         
##  1st Qu.:0.0000           1st Qu.:0                 1st Qu.: 2308         
##  Median :0.0000           Median :0                 Median : 8074         
##  Mean   :0.1756           Mean   :0                 Mean   :12039         
##  3rd Qu.:0.0000           3rd Qu.:0                 3rd Qu.:16422         
##  Max.   :3.0000           Max.   :0                 Max.   :97290         
##                                                                           
##  BankcardUtilization AvailableBankcardCredit  TotalTrades   
##  Min.   :0.0000      Min.   :     0          Min.   : 3.00  
##  1st Qu.:0.1800      1st Qu.:  1557          1st Qu.:14.50  
##  Median :0.4400      Median :  6999          Median :19.00  
##  Mean   :0.4524      Mean   : 13522          Mean   :22.21  
##  3rd Qu.:0.7200      3rd Qu.: 17470          3rd Qu.:29.00  
##  Max.   :0.9900      Max.   :110117          Max.   :52.00  
##                                                             
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.3000            Min.   :0.0000          Min.   :0.0200   
##  1st Qu.:0.8400            1st Qu.:0.0000          1st Qu.:0.1100   
##  Median :0.9600            Median :0.0000          Median :0.2000   
##  Mean   :0.8996            Mean   :0.5725          Mean   :0.2500   
##  3rd Qu.:1.0000            3rd Qu.:1.0000          3rd Qu.:0.2725   
##  Max.   :1.0000            Max.   :5.0000          Max.   :5.5900   
##                                                    NA's   :11       
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  $50,000-74,999:45   Mode :logical    Min.   :  212.8    
##  $25,000-49,999:40   FALSE:11         1st Qu.: 3333.3    
##  $75,000-99,999:17   TRUE :120        Median : 4616.7    
##  $100,000+     :16                    Mean   : 5111.2    
##  $1-24,999     :13                    3rd Qu.: 6375.0    
##  Not displayed : 0                    Max.   :20833.3    
##  (Other)       : 0                                       
##                     LoanKey    TotalProsperLoans
##  003C35735230494626ADB02:  1   Min.   :1.000    
##  02CA35638190585257E0D22:  1   1st Qu.:1.000    
##  030B35936026115966F4EA0:  1   Median :1.000    
##  032A357638786716375DFFB:  1   Mean   :1.153    
##  040235782802629332A0C8C:  1   3rd Qu.:1.000    
##  05BC35722810324548A02FE:  1   Max.   :3.000    
##  (Other)                :125   NA's   :72       
##  TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   : 1.00              Min.   : 0.00        
##  1st Qu.:14.50              1st Qu.:14.50        
##  Median :24.00              Median :22.00        
##  Mean   :22.76              Mean   :22.54        
##  3rd Qu.:34.00              3rd Qu.:33.50        
##  Max.   :42.00              Max.   :41.00        
##  NA's   :72                 NA's   :72           
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   :0.0000                      Min.   :0                      
##  1st Qu.:0.0000                      1st Qu.:0                      
##  Median :0.0000                      Median :0                      
##  Mean   :0.2203                      Mean   :0                      
##  3rd Qu.:0.0000                      3rd Qu.:0                      
##  Max.   :3.0000                      Max.   :0                      
##  NA's   :72                          NA's   :72                     
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :   0.00            
##  1st Qu.: 1775            1st Qu.:   0.00            
##  Median : 4500            Median :   0.00            
##  Mean   : 5491            Mean   : 428.24            
##  3rd Qu.: 7500            3rd Qu.:   0.25            
##  Max.   :27000            Max.   :5788.52            
##  NA's   :72               NA's   :72                 
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-50.00              Min.   :   0.00          
##  1st Qu.: -7.00              1st Qu.:   0.00          
##  Median : 39.00              Median :   0.00          
##  Mean   : 43.37              Mean   :  53.65          
##  3rd Qu.: 83.00              3rd Qu.:   0.00          
##  Max.   :215.00              Max.   :1257.00          
##  NA's   :74                                           
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber   
##  Min.   :10.00                 Min.   :39.00              Min.   :38046  
##  1st Qu.:18.00                 1st Qu.:44.00              1st Qu.:39344  
##  Median :23.00                 Median :49.00              Median :40869  
##  Mean   :24.22                 Mean   :48.34              Mean   :41386  
##  3rd Qu.:32.00                 3rd Qu.:52.00              3rd Qu.:43474  
##  Max.   :37.00                 Max.   :56.00              Max.   :46378  
##  NA's   :122                                                             
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2009-07-22   Q4 2009:32            
##  1st Qu.: 2000      1st Qu.:2009-11-08   Q3 2009:26            
##  Median : 3000      Median :2010-02-17   Q2 2010:21            
##  Mean   : 4187      Mean   :2010-03-11   Q4 2010:21            
##  3rd Qu.: 5000      3rd Qu.:2010-07-18   Q1 2010:17            
##  Max.   :15000      Max.   :2010-12-30   Q3 2010:14            
##                                          (Other): 0            
##                    MemberKey   MonthlyLoanPayment LP_CustomerPayments
##  010B33941340101099BFE47:  1   Min.   :  0.00     Min.   :  458.2    
##  016533808792025682035EE:  1   1st Qu.: 63.24     1st Qu.: 2161.4    
##  0CCD3420393708396FB7287:  1   Median :111.95     Median : 3865.5    
##  0F1733815422230679CFC01:  1   Mean   :146.00     Mean   : 4865.0    
##  0F5133834635103374519DF:  1   3rd Qu.:188.66     3rd Qu.: 6402.7    
##  10D73380714543112C251DF:  1   Max.   :578.69     Max.   :18748.2    
##  (Other)                :125                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :  204.8              Min.   :  11.26    Min.   :-242.93  
##  1st Qu.: 1946.1              1st Qu.: 254.88    1st Qu.: -62.53  
##  Median : 3000.0              Median : 546.00    Median : -38.67  
##  Mean   : 4043.8              Mean   : 821.17    Mean   : -50.11  
##  3rd Qu.: 5000.0              3rd Qu.:1143.52    3rd Qu.: -19.86  
##  Max.   :15000.0              Max.   :3748.19    Max.   :  -1.41  
##                                                                   
##  LP_CollectionFees LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :0         Min.   :   0.0        Min.   :   0.0     
##  1st Qu.:0         1st Qu.:   0.0        1st Qu.:   0.0     
##  Median :0         Median :   0.0        Median :   0.0     
##  Mean   :0         Mean   : 145.4        Mean   : 145.4     
##  3rd Qu.:0         3rd Qu.:   0.0        3rd Qu.:   0.0     
##  Max.   :0         Max.   :8911.2        Max.   :8911.2     
##                                                             
##  LP_NonPrincipalRecoverypayments PercentFunded Recommendations  
##  Min.   :0                       Min.   :1     Min.   :0.00000  
##  1st Qu.:0                       1st Qu.:1     1st Qu.:0.00000  
##  Median :0                       Median :1     Median :0.00000  
##  Mean   :0                       Mean   :1     Mean   :0.08397  
##  3rd Qu.:0                       3rd Qu.:1     3rd Qu.:0.00000  
##  Max.   :0                       Max.   :1     Max.   :2.00000  
##                                                                 
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors    
##  Min.   :0.00000            Min.   :   0.00             Min.   : 10.0  
##  1st Qu.:0.00000            1st Qu.:   0.00             1st Qu.: 75.5  
##  Median :0.00000            Median :   0.00             Median :124.0  
##  Mean   :0.03817            Mean   :  57.97             Mean   :155.5  
##  3rd Qu.:0.00000            3rd Qu.:   0.00             3rd Qu.:204.0  
##  Max.   :1.00000            Max.   :5140.00             Max.   :594.0  
## 

I see that 130 loans are missing a credit grade for no apparent reason. I don’t see any pattern here, and assume that it is impossible right now for me to tell why this data is missing. However, this is a relatively small amount of data.

I am otherwise assuming that CreditGrade was effectively replaced by ProsperScore in 2009, and that these can be used more-or-less interchangeably, particularly given that their labels correspond.

Next, I notice that only about half of the closed loans have estimated effective lender yields or several other estimates of yield/loss, although they are not closed. I assume these are pre-July 2009 listings, but I want to take a closer look at them.

summary(filter(data, !is.na(ClosedDate) & is.na(EstimatedEffectiveYield)))
##                    ListingKey    ListingNumber    ListingCreationDate 
##  00033425227988088FA6752:    1   Min.   :     4   Min.   :2005-11-09  
##  000433785890431972B4743:    1   1st Qu.: 92588   1st Qu.:2007-02-02  
##  00083422661625108817246:    1   Median :199844   Median :2007-09-10  
##  000A34209897973969CFA81:    1   Mean   :201960   Mean   :2007-08-26  
##  000D3410451511356B08F17:    1   3rd Qu.:314319   3rd Qu.:2008-04-19  
##  00143395229257559A91663:    1   Max.   :415961   Max.   :2009-05-06  
##  (Other)                :29078                                        
##   CreditGrade        Term                     LoanStatus   
##  C      :5649   Min.   :12   Completed             :18410  
##  D      :5153   1st Qu.:36   Chargedoff            : 6656  
##  B      :4389   Median :36   Defaulted             : 4013  
##  AA     :3509   Mean   :36   Cancelled             :    5  
##  HR     :3508   3rd Qu.:36   Current               :    0  
##  (Other):6745   Max.   :36   FinalPaymentInProgress:    0  
##  NA's   : 131                (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2005-11-25   Min.   :0.00653   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2008-08-25   1st Qu.:0.13705   1st Qu.:0.1269   1st Qu.: 0.1170  
##  Median :2009-08-17   Median :0.18224   Median :0.1700   Median : 0.1600  
##  Mean   :2009-07-30   Mean   :0.19596   Mean   :0.1833   Mean   : 0.1730  
##  3rd Qu.:2010-07-29   3rd Qu.:0.24753   3rd Qu.:0.2364   3rd Qu.: 0.2224  
##  Max.   :2013-10-12   Max.   :0.51229   Max.   :0.4975   Max.   : 0.4925  
##                       NA's   :25                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn ProsperRating.num
##  Min.   : NA             Min.   : NA     Min.   : NA     Min.   : NA      
##  1st Qu.: NA             1st Qu.: NA     1st Qu.: NA     1st Qu.: NA      
##  Median : NA             Median : NA     Median : NA     Median : NA      
##  Mean   :NaN             Mean   :NaN     Mean   :NaN     Mean   :NaN      
##  3rd Qu.: NA             3rd Qu.: NA     3rd Qu.: NA     3rd Qu.: NA      
##  Max.   : NA             Max.   : NA     Max.   : NA     Max.   : NA      
##  NA's   :29084           NA's   :29084   NA's   :29084   NA's   :29084    
##  ProsperRating.alpha  ProsperScore   ListingCategory.num BorrowerState  
##  NC     :    0       Min.   : NA     Min.   :0.000       CA     : 3956  
##  HR     :    0       1st Qu.: NA     1st Qu.:0.000       GA     : 1661  
##  E      :    0       Median : NA     Median :0.000       IL     : 1657  
##  D      :    0       Mean   :NaN     Mean   :1.203       FL     : 1314  
##  C      :    0       3rd Qu.: NA     3rd Qu.:1.000       TX     : 1208  
##  (Other):    0       Max.   : NA     Max.   :7.000       (Other):13773  
##  NA's   :29084       NA's   :29084                       NA's   : 5515  
##                Occupation         EmploymentStatus
##  Other              : 7300   Full-time    :18428  
##  Professional       : 3086   Not available: 5347  
##  Computer Programmer: 1242   Self-employed: 1596  
##  Sales - Commission : 1096   Part-time    :  832  
##  Clerical           : 1048   Retired      :  428  
##  (Other)            :13057   (Other)      :  198  
##  NA's               : 2255   NA's         : 2255  
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.00           Mode :logical       Mode :logical   
##  1st Qu.: 15.00           FALSE:16454         FALSE:18611     
##  Median : 40.00           TRUE :12630         TRUE :10473     
##  Mean   : 68.49                                               
##  3rd Qu.: 94.00                                               
##  Max.   :623.00                                               
##  NA's   :7606                                                 
##                     GroupKey     DateCreditPulled    
##  783C3371218786870A73D20:  932   Min.   :2005-11-09  
##  6A3B336601725506917317E:  619   1st Qu.:2007-01-30  
##  3D4D3366260257624AB272D:  606   Median :2007-09-04  
##  FEF83377364176536637E50:  529   Mean   :2007-08-24  
##  C9643379247860156A00EC0:  342   3rd Qu.:2008-04-17  
##  (Other)                : 8287   Max.   :2010-12-19  
##  NA's                   :17769                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1947-08-24     
##  1st Qu.:600.0         1st Qu.:619.0         1st Qu.:1990-07-26     
##  Median :640.0         Median :659.0         Median :1995-06-01     
##  Mean   :644.4         Mean   :663.4         Mean   :1994-08-07     
##  3rd Qu.:700.0         3rd Qu.:719.0         3rd Qu.:1999-08-31     
##  Max.   :880.0         Max.   :899.0         Max.   :2008-07-01     
##  NA's   :591           NA's   :591           NA's   :697            
##  CurrentCreditLines OpenCreditLines TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.0    Min.   :  2.00            
##  1st Qu.: 5.000     1st Qu.: 4.0    1st Qu.: 13.00            
##  Median : 9.000     Median : 7.0    Median : 22.00            
##  Mean   : 9.563     Mean   : 8.2    Mean   : 24.06            
##  3rd Qu.:13.000     3rd Qu.:11.0    3rd Qu.: 32.00            
##  Max.   :52.000     Max.   :51.0    Max.   :136.00            
##  NA's   :7604       NA's   :7604    NA's   :697               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 2.000        1st Qu.:   35.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  139.0             Median :  2.000     
##  Mean   : 5.755        Mean   :  303.7             Mean   :  2.841     
##  3rd Qu.: 8.000        3rd Qu.:  374.0             3rd Qu.:  4.000     
##  Max.   :51.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :697         
##  TotalInquiries    CurrentDelinquencies AmountDelinquent
##  Min.   :  0.000   Min.   : 0.000       Min.   :     0  
##  1st Qu.:  3.000   1st Qu.: 0.000       1st Qu.:     0  
##  Median :  7.000   Median : 0.000       Median :     0  
##  Mean   :  9.516   Mean   : 1.398       Mean   :  1118  
##  3rd Qu.: 13.000   3rd Qu.: 1.000       3rd Qu.:    30  
##  Max.   :379.000   Max.   :83.000       Max.   :444745  
##  NA's   :1159      NA's   :697          NA's   :7622    
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 5.652          Mean   : 0.3949         
##  3rd Qu.: 6.000          3rd Qu.: 1.0000         
##  Max.   :99.000          Max.   :30.0000         
##  NA's   :990             NA's   :697             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.000             Min.   :      0        Min.   :0.00       
##  1st Qu.:0.000             1st Qu.:   1192        1st Qu.:0.20       
##  Median :0.000             Median :   5206        Median :0.60       
##  Mean   :0.039             Mean   :  16250        Mean   :0.55       
##  3rd Qu.:0.000             3rd Qu.:  15590        3rd Qu.:0.88       
##  Max.   :7.000             Max.   :1435667        Max.   :5.95       
##  NA's   :7604              NA's   :7604           NA's   :7604       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  0.00   Min.   :0.000            
##  1st Qu.:   253          1st Qu.: 11.00   1st Qu.:0.690            
##  Median :  2277          Median : 18.00   Median :0.870            
##  Mean   : 10460          Mean   : 20.48   Mean   :0.807            
##  3rd Qu.: 10162          3rd Qu.: 28.00   3rd Qu.:1.000            
##  Max.   :646285          Max.   :126.00   Max.   :1.000            
##  NA's   :7544            NA's   :7544     NA's   :7544             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.000          Min.   : 0.0000   $25,000-49,999:8017  
##  1st Qu.: 0.000          1st Qu.: 0.1200   Not displayed :7741  
##  Median : 1.000          Median : 0.2000   $50,000-74,999:5423  
##  Mean   : 1.088          Mean   : 0.3239   $1-24,999     :2620  
##  3rd Qu.: 2.000          3rd Qu.: 0.3000   $75,000-99,999:2418  
##  Max.   :17.000          Max.   :10.0100   $100,000+     :2132  
##  NA's   :7544            NA's   :1258      (Other)       : 733  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      00013421083473792D70F75:    1  
##  FALSE:1336       1st Qu.:  2500      000534180797040005C07AA:    1  
##  TRUE :27748      Median :  3833      00093413855467649508680:    1  
##                   Mean   :  4665      000B3366346245964D6187E:    1  
##                   3rd Qu.:  5752      000B34179327090460D3429:    1  
##                   Max.   :208333      000E3392089465002A7DBA0:    1  
##                                       (Other)                :29078  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.000     Min.   : 0.00              Min.   : 0.00        
##  1st Qu.:1.000     1st Qu.: 7.00              1st Qu.: 6.00        
##  Median :1.000     Median :10.00              Median :10.00        
##  Mean   :1.079     Mean   :11.09              Mean   :10.87        
##  3rd Qu.:1.000     3rd Qu.:14.00              3rd Qu.:14.00        
##  Max.   :5.000     Max.   :42.00              Max.   :41.00        
##  NA's   :26796     NA's   :26796              NA's   :26796        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   :0.000                       Min.   :0.000                  
##  1st Qu.:0.000                       1st Qu.:0.000                  
##  Median :0.000                       Median :0.000                  
##  Mean   :0.205                       Mean   :0.011                  
##  3rd Qu.:0.000                       3rd Qu.:0.000                  
##  Max.   :7.000                       Max.   :5.000                  
##  NA's   :26796                       NA's   :26796                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0              
##  1st Qu.: 2550            1st Qu.:    0              
##  Median : 4500            Median : 1970              
##  Mean   : 6012            Mean   : 3027              
##  3rd Qu.: 7500            3rd Qu.: 4145              
##  Max.   :40000            Max.   :21862              
##  NA's   :26796            NA's   :26796              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-160.000            Min.   :   0.0           
##  1st Qu.:   0.000            1st Qu.:   0.0           
##  Median :   0.000            Median :   0.0           
##  Mean   :   7.363            Mean   : 491.8           
##  3rd Qu.:  40.000            3rd Qu.: 948.2           
##  Max.   : 215.000            Max.   :2704.0           
##  NA's   :26798                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber   
##  Min.   : 0.00                 Min.   : 39.00             Min.   :    1  
##  1st Qu.:10.00                 1st Qu.: 70.00             1st Qu.: 7395  
##  Median :16.00                 Median : 78.00             Median :19450  
##  Mean   :17.32                 Mean   : 78.21             Mean   :19418  
##  3rd Qu.:24.00                 3rd Qu.: 85.00             3rd Qu.:30463  
##  Max.   :44.00                 Max.   :100.00             Max.   :46378  
##  NA's   :18376                                                           
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-15   Q2 2008: 4344         
##  1st Qu.: 2500      1st Qu.:2007-02-13   Q3 2008: 3602         
##  Median : 4500      Median :2007-09-21   Q2 2007: 3118         
##  Mean   : 6159      Mean   :2007-09-09   Q1 2007: 3079         
##  3rd Qu.: 7904      3rd Qu.:2008-05-02   Q1 2008: 3074         
##  Max.   :25000      Max.   :2010-12-30   (Other):11845         
##                                          NA's   :   22         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  3EF133647645155044BFFD9:    6   Min.   :   0.00    Min.   :    0      
##  7E1733653050264822FAA3D:    6   1st Qu.:  84.84    1st Qu.: 1647      
##  16083364744933457E57FB9:    4   Median : 153.80    Median : 3778      
##  242A33660960718280E1642:    4   Mean   : 215.72    Mean   : 5683      
##  5B8333756488098823F5EFE:    4   3rd Qu.: 275.77    3rd Qu.: 7403      
##  63CA34120866140639431C9:    4   Max.   :1130.90    Max.   :40702      
##  (Other)                :29056                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0                Min.   :    0.0    Min.   :-664.87  
##  1st Qu.: 1069                1st Qu.:  335.4    1st Qu.: -76.15  
##  Median : 3000                Median :  779.3    Median : -33.50  
##  Mean   : 4502                Mean   : 1180.7    Mean   : -54.97  
##  3rd Qu.: 6000                3rd Qu.: 1532.2    3rd Qu.: -13.14  
##  Max.   :25693                Max.   :15617.0    Max.   :  32.06  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :    0         Min.   : -954.5    
##  1st Qu.:    0.00   1st Qu.:    0         1st Qu.:    0.0    
##  Median :    0.00   Median :    0         Median :    0.0    
##  Mean   :  -31.86   Mean   : 1647         Mean   : 1596.6    
##  3rd Qu.:    0.00   3rd Qu.: 1863         3rd Qu.: 1748.7    
##  Max.   :    0.00   Max.   :25000         Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded   Recommendations  
##  Min.   :    0.00                Min.   :1.000   Min.   : 0.0000  
##  1st Qu.:    0.00                1st Qu.:1.000   1st Qu.: 0.0000  
##  Median :    0.00                Median :1.000   Median : 0.0000  
##  Mean   :   76.19                Mean   :1.000   Mean   : 0.1369  
##  3rd Qu.:    0.00                3rd Qu.:1.000   3rd Qu.: 0.0000  
##  Max.   :21117.90                Max.   :1.011   Max.   :39.0000  
##                                                                   
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors    
##  Min.   : 0.00000           Min.   :    0.00            Min.   :  1.0  
##  1st Qu.: 0.00000           1st Qu.:    0.00            1st Qu.: 34.0  
##  Median : 0.00000           Median :    0.00            Median : 78.0  
##  Mean   : 0.06842           Mean   :   52.25            Mean   :116.1  
##  3rd Qu.: 0.00000           3rd Qu.:    0.00            3rd Qu.:158.0  
##  Max.   :33.00000           Max.   :25000.00            Max.   :913.0  
## 

That is indeed the case, and as the same percentage of the other similar measures is missing, I will assume this is also the case for those measures.

I see that some borrower demographic, employment, and previous credit information is missing, but I assume that this is simply missing data, with no larger story behind it, particularly as this is a relatively small percentage of loans. I also see that more of this information is missing for loans that have been closed, which suggests to me that this data was either lost, or not gathered as thoroughly in the past.

The majority of the borrowers in both categories have no prior Prosper history, and it would be interesting to see if, for example, not having any Prosper history leads to more delinquencies than having positive Prosper history.

Most loans were not charged off, but about 30% of closed loans at least at some point became delinquent (LoanFirstDefaultedCycleNumber). A very small number of open loans are delinquent.

Lender Profit

At this point, I want to take a look, through plotting correlations, at how predictive the above background, financial, or demographic measures are of measures most closely related to lender profit.

LoanStatus vs. CreditGrade/ProsperRating

In the case of LoanStatus, as this is not a quantitative or clearly ordered factor, it may make sense to at least visually organize some of the levels. I therefore ‘group’ all Past Due levels together, and order the levels loosely in terms of ‘goodness’ - assuming that being on time, or having paid off the loan, is ‘good,’ and that having defaulted, or having the loan charged off, is ‘bad.’ I group CreditGrade and ProsperRating into one measure, and then plot LoanStatus by this new rating, to see if there are any obvious patterns on how likely one is to have a particular loan status, given a particular starting rating.

pastDue <- levels(data$LoanStatus)[7:12]

plot_data <- data %>% 
  mutate(LoanStatus = fct_recode(LoanStatus, "PastDue" = pastDue[1], "PastDue" = pastDue[2], "PastDue" = pastDue[3], "PastDue" = pastDue[4], "PastDue" = pastDue[5], "PastDue" = pastDue[6])) %>%
  mutate(Rating = coalesce(CreditGrade, ProsperRating.alpha)) %>%
  mutate(LoanStatus = ordered(LoanStatus, c("Defaulted","Chargedoff","PastDue","Cancelled","Current","FinalPaymentInProgress","Completed"))) %>%
  group_by(Rating, LoanStatus) %>% 
  tally %>% 
  mutate(percent = n/sum(n))

ggplot(plot_data, aes(x = LoanStatus, y = percent)) +
  geom_bar(stat = "identity") +
  geom_text(aes(label = percent(percent)), hjust = -0.25) +
  labs(title = "Loan Status by Rating", y = "Percent", x = "Loan Status") +
  scale_y_continuous(labels = percent, limits = c(0,1)) +
  scale_x_discrete(labels = function(x) str_wrap(x, 10)) +
  facet_wrap(~Rating) +
  coord_flip()

What I see here is that the higher the rating, the greater the likelihood that the loan is either completed or current, and the less the likelihood that it is past due, charged off, or defaulted. Overall, it seems that a customer with a higher rating at the time the loan is posted will indeed be more likely to pay off a loan in the future.

Exploring Profit Measures

First, I want to get a sense of when these measures might be getting assigned, in cases where documentation does not make this clear. To make this more clear, I will look at loans which have not been closed, and see if they systematically include this information (compared to loans which are closed). If they do, it’s relatively safe to say that these measures are predictions, rather than reports of actual yield.

summary(filter(data, is.na(ClosedDate)))
##                    ListingKey    ListingNumber     ListingCreationDate 
##  17A93590655669644DB4C06:    6   Min.   : 464139   Min.   :2010-06-24  
##  349D3587495831350F0F648:    4   1st Qu.: 682358   1st Qu.:2012-12-04  
##  47C1359638497431975670B:    4   Median : 875238   Median :2013-08-20  
##  8474358854651984137201C:    4   Mean   : 870182   Mean   :2013-05-16  
##  DE8535960513435199406CE:    4   3rd Qu.:1051465   3rd Qu.:2013-12-05  
##  04C13599434217079754AEE:    3   Max.   :1255725   Max.   :2014-03-10  
##  (Other)                :58823                                         
##   CreditGrade         Term                        LoanStatus   
##  NC     :    0   Min.   :12.00   Current               :56576  
##  HR     :    0   1st Qu.:36.00   Past Due (1-15 days)  :  806  
##  E      :    0   Median :36.00   Past Due (31-60 days) :  363  
##  D      :    0   Mean   :44.47   Past Due (61-90 days) :  313  
##  C      :    0   3rd Qu.:60.00   Past Due (91-120 days):  304  
##  (Other):    0   Max.   :60.00   Past Due (16-30 days) :  265  
##  NA's   :58848                   (Other)               :  221  
##    ClosedDate     BorrowerAPR       BorrowerRate     LenderYield    
##  Min.   :NA      Min.   :0.06106   Min.   :0.0577   Min.   :0.0477  
##  1st Qu.:NA      1st Qu.:0.16056   1st Qu.:0.1334   1st Qu.:0.1234  
##  Median :NA      Median :0.20679   Median :0.1769   Median :0.1669  
##  Mean   :NA      Mean   :0.21568   Mean   :0.1856   Mean   :0.1756  
##  3rd Qu.:NA      3rd Qu.:0.26877   3rd Qu.:0.2346   3rd Qu.:0.2246  
##  Max.   :NA      Max.   :0.38486   Max.   :0.3435   Max.   :0.3335  
##  NA's   :58848                                                      
##  EstimatedEffectiveYield EstimatedLoss     EstimatedReturn  
##  Min.   :0.0474          Min.   :0.00490   Min.   :0.03700  
##  1st Qu.:0.1181          1st Qu.:0.04200   1st Qu.:0.07400  
##  Median :0.1575          Median :0.06490   Median :0.08728  
##  Mean   :0.1653          Mean   :0.07435   Mean   :0.09100  
##  3rd Qu.:0.2086          3rd Qu.:0.10250   3rd Qu.:0.10790  
##  Max.   :0.3057          Max.   :0.20300   Max.   :0.17610  
##                                                             
##  ProsperRating.num ProsperRating.alpha  ProsperScore   ListingCategory.num
##  Min.   :1.000     C      :14528       Min.   : 1.00   Min.   : 0.000     
##  1st Qu.:3.000     B      :12208       1st Qu.: 4.00   1st Qu.: 1.000     
##  Median :4.000     A      :10943       Median : 6.00   Median : 1.000     
##  Mean   :4.253     D      : 8405       Mean   : 5.81   Mean   : 3.118     
##  3rd Qu.:5.000     E      : 5965       3rd Qu.: 8.00   3rd Qu.: 2.000     
##  Max.   :7.000     AA     : 3589       Max.   :11.00   Max.   :20.000     
##                    (Other): 3210                                          
##  BorrowerState                 Occupation         EmploymentStatus
##  CA     : 7454   Other              :14561   Employed     :50831  
##  NY     : 4214   Professional       : 7113   Self-employed: 3208  
##  TX     : 4090   Executive          : 2522   Other        : 3008  
##  FL     : 3642   Teacher            : 2111   Full-time    : 1397  
##  IL     : 2882   Computer Programmer: 1984   Not employed :  274  
##  OH     : 2389   (Other)            :29237   Retired      :   98  
##  (Other):34177   NA's               : 1320   (Other)      :   32  
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.0            Mode :logical       Mode :logical   
##  1st Qu.: 32.0            FALSE:27257         FALSE:57973     
##  Median : 79.0            TRUE :31591         TRUE :875       
##  Mean   :108.3                                                
##  3rd Qu.:156.0                                                
##  Max.   :733.0                                                
##  NA's   :10                                                   
##                     GroupKey     DateCreditPulled    
##  3D4D3366260257624AB272D:  110   Min.   :2008-01-23  
##  783C3371218786870A73D20:   79   1st Qu.:2012-12-03  
##  52EA3425051368132B80C96:   41   Median :2013-08-22  
##  FEF83377364176536637E50:   29   Mean   :2013-05-17  
##  6A3B336601725506917317E:   26   3rd Qu.:2013-12-05  
##  (Other)                :  387   Max.   :2014-03-10  
##  NA's                   :58176                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :600.0         Min.   :619.0         Min.   :1951-01-01     
##  1st Qu.:660.0         1st Qu.:679.0         1st Qu.:1990-03-01     
##  Median :700.0         Median :719.0         Median :1995-11-22     
##  Mean   :698.4         Mean   :717.4         Mean   :1994-11-04     
##  3rd Qu.:720.0         3rd Qu.:739.0         3rd Qu.:2000-05-11     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-12-22     
##                                                                     
##  CurrentCreditLines OpenCreditLines TotalCreditLinespast7years
##  Min.   : 0.00      Min.   : 0      Min.   :  2.00            
##  1st Qu.: 7.00      1st Qu.: 7      1st Qu.: 19.00            
##  Median :10.00      Median : 9      Median : 27.00            
##  Mean   :10.92      Mean   :10      Mean   : 28.12            
##  3rd Qu.:14.00      3rd Qu.:13      3rd Qu.: 36.00            
##  Max.   :54.00      Max.   :54      Max.   :125.00            
##                                                               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   : 0.0000     
##  1st Qu.: 5.000        1st Qu.:  188.0             1st Qu.: 0.0000     
##  Median : 7.000        Median :  344.0             Median : 0.0000     
##  Mean   : 7.805        Mean   :  466.6             Mean   : 0.8649     
##  3rd Qu.:10.000        3rd Qu.:  606.0             3rd Qu.: 1.0000     
##  Max.   :50.000        Max.   :13765.0             Max.   :15.0000     
##                                                                        
##  TotalInquiries   CurrentDelinquencies AmountDelinquent
##  Min.   : 0.000   Min.   : 0.0000      Min.   :     0  
##  1st Qu.: 2.000   1st Qu.: 0.0000      1st Qu.:     0  
##  Median : 3.000   Median : 0.0000      Median :     0  
##  Mean   : 4.134   Mean   : 0.3015      Mean   :   931  
##  3rd Qu.: 6.000   3rd Qu.: 0.0000      3rd Qu.:     0  
##  Max.   :78.000   Max.   :51.0000      Max.   :463881  
##                                                        
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 3.772          Mean   : 0.2956         
##  3rd Qu.: 2.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :38.0000         
##                                                  
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   : 0.00000          Min.   :     0         Min.   :0.0000     
##  1st Qu.: 0.00000          1st Qu.:  4736         1st Qu.:0.3700     
##  Median : 0.00000          Median : 10388         Median :0.6200     
##  Mean   : 0.00814          Mean   : 19140         Mean   :0.5862     
##  3rd Qu.: 0.00000          3rd Qu.: 21972         3rd Qu.:0.8300     
##  Max.   :20.00000          Max.   :999165         Max.   :1.8200     
##                                                                      
##  AvailableBankcardCredit  TotalTrades    TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  1.0   Min.   :0.0800           
##  1st Qu.:  1296          1st Qu.: 16.0   1st Qu.:0.8500           
##  Median :  4727          Median : 23.0   Median :0.9600           
##  Mean   : 11506          Mean   : 24.4   Mean   :0.9097           
##  3rd Qu.: 14111          3rd Qu.: 31.0   3rd Qu.:1.0000           
##  Max.   :498374          Max.   :108.0   Max.   :1.0000           
##                                                                   
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange   
##  Min.   : 0.0000         Min.   : 0.000    $50,000-74,999:18261  
##  1st Qu.: 0.0000         1st Qu.: 0.160    $25,000-49,999:15848  
##  Median : 0.0000         Median : 0.230    $100,000+     :11273  
##  Mean   : 0.7159         Mean   : 0.263    $75,000-99,999:10474  
##  3rd Qu.: 1.0000         3rd Qu.: 0.320    $1-24,999     : 2703  
##  Max.   :16.0000         Max.   :10.010    Not employed  :  274  
##                          NA's   :4324      (Other)       :   15  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :      0     CB1B37030986463208432A1:    6  
##  FALSE:4368       1st Qu.:   3617     2DEE3698211017519D7333F:    4  
##  TRUE :54480      Median :   5167     9F4B37043517554537C364C:    4  
##                   Mean   :   6126     D895370150591392337ED6D:    4  
##                   3rd Qu.:   7417     E6FB37073953690388BC56D:    4  
##                   Max.   :1750003     0D8F37036734373301ED419:    3  
##                                       (Other)                :58823  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.0       Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.0       1st Qu.: 10.00             1st Qu.: 10.00       
##  Median :1.0       Median : 17.00             Median : 17.00       
##  Mean   :1.5       Mean   : 25.54             Mean   : 24.81       
##  3rd Qu.:2.0       3rd Qu.: 35.00             3rd Qu.: 35.00       
##  Max.   :8.0       Max.   :141.00             Max.   :141.00       
##  NA's   :47302     NA's   :47302              NA's   :47302        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.00                       Min.   : 0.00                  
##  1st Qu.: 0.00                       1st Qu.: 0.00                  
##  Median : 0.00                       Median : 0.00                  
##  Mean   : 0.68                       Mean   : 0.05                  
##  3rd Qu.: 0.00                       3rd Qu.: 0.00                  
##  Max.   :42.00                       Max.   :21.00                  
##  NA's   :47302                       NA's   :47302                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0.00           
##  1st Qu.: 4000            1st Qu.:    0.01           
##  Median : 7400            Median : 2213.24           
##  Mean   : 9721            Mean   : 3475.83           
##  3rd Qu.:13500            3rd Qu.: 5204.00           
##  Max.   :72499            Max.   :23450.95           
##  NA's   :47302            NA's   :47302              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-209.0              Min.   :  0.000          
##  1st Qu.: -38.0              1st Qu.:  0.000          
##  Median :  -9.0              Median :  0.000          
##  Mean   :  -8.6              Mean   :  1.468          
##  3rd Qu.:  18.0              3rd Qu.:  0.000          
##  Max.   : 220.0              Max.   :129.000          
##  NA's   :50362                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 1.00                 Min.   : 0.00              Min.   : 43212  
##  1st Qu.: 1.00                 1st Qu.: 3.00              1st Qu.: 79386  
##  Median : 7.50                 Median : 7.00              Median :100276  
##  Mean   :11.88                 Mean   : 9.68              Mean   : 98941  
##  3rd Qu.:17.25                 3rd Qu.:15.00              3rd Qu.:121614  
##  Max.   :38.00                 Max.   :45.00              Max.   :136486  
##  NA's   :58840                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1500      Min.   :2010-06-30   Q4 2013:14058         
##  1st Qu.: 4000      1st Qu.:2012-12-18   Q1 2014:12103         
##  Median :10000      Median :2013-08-29   Q3 2013: 8592         
##  Mean   :10280      Mean   :2013-05-27   Q2 2013: 6268         
##  3rd Qu.:15000      3rd Qu.:2013-12-16   Q3 2012: 3419         
##  Max.   :35000      Max.   :2014-03-12   Q4 2012: 3022         
##                                          (Other):11386         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  F80D3694083622957BA09F2:    6   Min.   :   0.0     Min.   :    0      
##  0F0C35762146892131F3BB4:    4   1st Qu.: 166.6     1st Qu.:  555      
##  22B53699795042922A27DCC:    4   Median : 286.9     Median : 1516      
##  61E93477058090904D07D4F:    4   Mean   : 318.1     Mean   : 2550      
##  946A35068649687154063A9:    4   3rd Qu.: 415.1     3rd Qu.: 3367      
##  EA463494084516244B9C542:    4   Max.   :2163.6     Max.   :31613      
##  (Other)                :58822                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0.0              Min.   :    0.0    Min.   :-564.85  
##  1st Qu.:  286.8              1st Qu.:  221.7    1st Qu.: -73.29  
##  Median :  795.5              Median :  640.4    Median : -34.75  
##  Mean   : 1519.1              Mean   : 1031.3    Mean   : -55.72  
##  3rd Qu.: 1872.4              3rd Qu.: 1410.9    3rd Qu.: -13.11  
##  Max.   :30831.1              Max.   :10572.8    Max.   :   0.77  
##                                                                   
##  LP_CollectionFees   LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-1242.460   Min.   :0             Min.   :0          
##  1st Qu.:    0.000   1st Qu.:0             1st Qu.:0          
##  Median :    0.000   Median :0             Median :0          
##  Mean   :   -4.171   Mean   :0             Mean   :0          
##  3rd Qu.:    0.000   3rd Qu.:0             3rd Qu.:0          
##  Max.   :    0.000   Max.   :0             Max.   :0          
##                                                               
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations    
##  Min.   :0                       Min.   :0.7000   Min.   : 0.000000  
##  1st Qu.:0                       1st Qu.:1.0000   1st Qu.: 0.000000  
##  Median :0                       Median :1.0000   Median : 0.000000  
##  Mean   :0                       Mean   :0.9986   Mean   : 0.009312  
##  3rd Qu.:0                       3rd Qu.:1.0000   3rd Qu.: 0.000000  
##  Max.   :0                       Max.   :1.0125   Max.   :19.000000  
##                                                                      
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   :0.00000            Min.   :   0.0000           Min.   :  1.00  
##  1st Qu.:0.00000            1st Qu.:   0.0000           1st Qu.:  1.00  
##  Median :0.00000            Median :   0.0000           Median :  8.00  
##  Mean   :0.00226            Mean   :   0.6037           Mean   : 57.62  
##  3rd Qu.:0.00000            3rd Qu.:   0.0000           3rd Qu.: 79.00  
##  Max.   :6.00000            Max.   :3000.0000           Max.   :779.00  
## 

All of these open loans have non-zero values assigned to the following measures, suggesting that these measures are predictive rather than descriptive of actual outcomes: LenderYield, EstimatedEffectiveYield, EstimatedLoss, EstimatedReturn. On the other hand, many open loans have zero values assigned for these profit measures: LP_CustomerPayments, LP_CustomerPrincipalPayments, LP_InterestandFees, LP_ServiceFees, LP_CollectionFees, LP_GrossPrincipalLoss, LP_NetPrincipalLoss, and LP_NonPrincipalRecoverypayments (in fact, the last 3 have only zero values assigned). These I will take a closer look at.

summary(filter(data, LP_CustomerPayments==0))
##                    ListingKey   ListingNumber     ListingCreationDate 
##  8474358854651984137201C:   4   Min.   :    908   Min.   :2006-02-28  
##  04C13599434217079754AEE:   3   1st Qu.:1159590   1st Qu.:2014-02-03  
##  0A0635972629771021E38F3:   3   Median :1188042   Median :2014-02-14  
##  26C835968174004476E551B:   3   Mean   :1122524   Mean   :2013-10-02  
##  78D835971025680406A3489:   3   3rd Qu.:1213931   3rd Qu.:2014-02-25  
##  873E36032681397836823F7:   3   Max.   :1255725   Max.   :2014-03-10  
##  (Other)                :6189                                         
##   CreditGrade        Term                       LoanStatus  
##  HR     : 125   Min.   :12.00   Current              :5695  
##  E      :  54   1st Qu.:36.00   Chargedoff           : 267  
##  C      :  32   Median :36.00   Defaulted            : 199  
##  D      :  26   Mean   :43.22   Completed            :  10  
##  B      :  22   3rd Qu.:60.00   Past Due (31-60 days):   7  
##  (Other):  18   Max.   :60.00   Past Due (61-90 days):   7  
##  NA's   :5931                   (Other)              :  23  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2006-03-29   Min.   :0.00864   Min.   :0.0021   Min.   :-0.0029  
##  1st Qu.:2007-08-13   1st Qu.:0.14243   1st Qu.:0.1159   1st Qu.: 0.1059  
##  Median :2008-12-31   Median :0.18222   Median :0.1500   Median : 0.1400  
##  Mean   :2009-11-19   Mean   :0.18989   Mean   :0.1613   Mean   : 0.1512  
##  3rd Qu.:2012-05-31   3rd Qu.:0.22301   3rd Qu.:0.1920   3rd Qu.: 0.1820  
##  Max.   :2014-03-03   Max.   :0.42395   Max.   :0.3600   Max.   : 0.3400  
##  NA's   :5727                                                             
##  EstimatedEffectiveYield EstimatedLoss     EstimatedReturn   
##  Min.   :-0.01660        Min.   :0.00600   Min.   :-0.01660  
##  1st Qu.: 0.09989        1st Qu.:0.03490   1st Qu.: 0.06491  
##  Median : 0.12898        Median :0.05740   Median : 0.07349  
##  Mean   : 0.13745        Mean   :0.06419   Mean   : 0.07387  
##  3rd Qu.: 0.16463        3rd Qu.:0.08490   3rd Qu.: 0.08027  
##  Max.   : 0.30570        Max.   :0.25000   Max.   : 0.19100  
##  NA's   :277             NA's   :277       NA's   :277       
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     C      :1634        Min.   : 1.000  
##  1st Qu.:4.000     A      :1354        1st Qu.: 4.000  
##  Median :5.000     B      :1302        Median : 6.000  
##  Mean   :4.637     D      : 552        Mean   : 5.971  
##  3rd Qu.:6.000     AA     : 523        3rd Qu.: 8.000  
##  Max.   :7.000     (Other): 566        Max.   :11.000  
##  NA's   :277       NA's   : 277        NA's   :277     
##  ListingCategory.num BorrowerState                Occupation  
##  1      :4548        CA     : 851   Other              :1350  
##  7      : 483        TX     : 427   Professional       : 722  
##  2      : 287        NY     : 410   Executive          : 260  
##  0      : 193        FL     : 386   Computer Programmer: 213  
##  3      : 180        IL     : 301   Teacher            : 195  
##  15     : 109        (Other):3814   (Other)            :3127  
##  (Other): 408        NA's   :  19   NA's               : 341  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Employed     :5087    Min.   :  0.00           Mode :logical      
##  Self-employed: 438    1st Qu.: 26.25           FALSE:3006         
##  Other        : 365    Median : 77.00           TRUE :3202         
##  Full-time    : 192    Mean   :104.79                              
##  Not available:  79    3rd Qu.:153.00                              
##  (Other)      :  16    Max.   :649.00                              
##  NA's         :  31    NA's   :110                                 
##  CurrentlyInGroup                    GroupKey    DateCreditPulled    
##  Mode :logical    6A3B336601725506917317E:  15   Min.   :2006-02-16  
##  FALSE:6052       3D4D3366260257624AB272D:  12   1st Qu.:2014-02-04  
##  TRUE :156        783C3371218786870A73D20:  10   Median :2014-02-14  
##                   FEF83377364176536637E50:   9   Mean   :2013-10-03  
##                   F3BE336490588367617A2BA:   7   3rd Qu.:2014-02-24  
##                   (Other)                :  84   Max.   :2014-03-10  
##                   NA's                   :6071                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1953-12-01     
##  1st Qu.:660.0         1st Qu.:679.0         1st Qu.:1990-07-01     
##  Median :680.0         Median :699.0         Median :1996-07-29     
##  Mean   :688.2         Mean   :707.2         Mean   :1995-06-11     
##  3rd Qu.:720.0         3rd Qu.:739.0         3rd Qu.:2001-02-12     
##  Max.   :840.0         Max.   :859.0         Max.   :2012-09-17     
##  NA's   :1             NA's   :1             NA's   :3              
##  CurrentCreditLines OpenCreditLines TotalCreditLinespast7years
##  Min.   : 0.00      Min.   : 0.0    Min.   :  2.00            
##  1st Qu.: 8.00      1st Qu.: 7.0    1st Qu.: 19.00            
##  Median :10.00      Median :10.0    Median : 26.00            
##  Mean   :11.34      Mean   :10.5    Mean   : 27.95            
##  3rd Qu.:14.00      3rd Qu.:13.0    3rd Qu.: 35.00            
##  Max.   :54.00      Max.   :54.0    Max.   :125.00            
##  NA's   :110        NA's   :110     NA's   :3                 
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :   0.0              Min.   : 0.000      
##  1st Qu.: 5.000        1st Qu.: 196.8              1st Qu.: 0.000      
##  Median : 7.000        Median : 363.5              Median : 1.000      
##  Mean   : 8.115        Mean   : 500.0              Mean   : 1.073      
##  3rd Qu.:10.000        3rd Qu.: 666.0              3rd Qu.: 1.000      
##  Max.   :50.000        Max.   :7090.0              Max.   :53.000      
##                                                    NA's   :3           
##  TotalInquiries   CurrentDelinquencies AmountDelinquent  
##  Min.   : 0.000   Min.   : 0.0000      Min.   :     0.0  
##  1st Qu.: 2.000   1st Qu.: 0.0000      1st Qu.:     0.0  
##  Median : 4.000   Median : 0.0000      Median :     0.0  
##  Mean   : 4.863   Mean   : 0.4219      Mean   :   679.2  
##  3rd Qu.: 7.000   3rd Qu.: 0.0000      3rd Qu.:     0.0  
##  Max.   :70.000   Max.   :83.0000      Max.   :183396.0  
##  NA's   :7        NA's   :3            NA's   :111       
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 3.539          Mean   : 0.2967         
##  3rd Qu.: 1.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :38.0000         
##  NA's   :6               NA's   :3               
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.00000           Min.   :     0         Min.   :0.0000     
##  1st Qu.:0.00000           1st Qu.:  5154         1st Qu.:0.3600     
##  Median :0.00000           Median : 11390         Median :0.6100     
##  Mean   :0.00508           Mean   : 21079         Mean   :0.5784     
##  3rd Qu.:0.00000           3rd Qu.: 24664         3rd Qu.:0.8200     
##  Max.   :2.00000           Max.   :976426         Max.   :1.9000     
##  NA's   :110               NA's   :110            NA's   :110        
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  1.00   Min.   :0.0000           
##  1st Qu.:  1762          1st Qu.: 16.00   1st Qu.:0.8800           
##  Median :  5798          Median : 23.00   Median :0.9700           
##  Mean   : 13349          Mean   : 24.48   Mean   :0.9195           
##  3rd Qu.: 17087          3rd Qu.: 31.00   3rd Qu.:1.0000           
##  Max.   :221237          Max.   :108.00   Max.   :1.0000           
##  NA's   :110             NA's   :110      NA's   :110              
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.0000         Min.   : 0.0100   $50,000-74,999:1889  
##  1st Qu.: 0.0000         1st Qu.: 0.1700   $25,000-49,999:1582  
##  Median : 1.0000         Median : 0.2400   $100,000+     :1281  
##  Mean   : 0.8129         Mean   : 0.2619   $75,000-99,999:1104  
##  3rd Qu.: 1.0000         3rd Qu.: 0.3300   $1-24,999     : 232  
##  Max.   :12.0000         Max.   :10.0100   Not displayed : 113  
##  NA's   :110             NA's   :458       (Other)       :   7  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey    
##  Mode :logical    Min.   :    0       E6FB37073953690388BC56D:   4  
##  FALSE:459        1st Qu.: 3723       10D33705822704973E703BB:   3  
##  TRUE :5749       Median : 5313       1C10370687519959757D4E0:   3  
##                   Mean   : 6179       50F23708735181834951669:   3  
##                   3rd Qu.: 7542       547237051355919565459AB:   3  
##                   Max.   :70833       5D463706577381028D227CB:   3  
##                                       (Other)                :6189  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.000     Min.   :  1.00             Min.   :  0.00       
##  1st Qu.:1.000     1st Qu.: 10.00             1st Qu.: 10.00       
##  Median :1.000     Median : 18.00             Median : 17.00       
##  Mean   :1.556     Mean   : 25.45             Mean   : 24.87       
##  3rd Qu.:2.000     3rd Qu.: 35.00             3rd Qu.: 33.00       
##  Max.   :7.000     Max.   :131.00             Max.   :131.00       
##  NA's   :5564      NA's   :5564               NA's   :5564         
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   :0.000                  
##  1st Qu.: 0.000                      1st Qu.:0.000                  
##  Median : 0.000                      Median :0.000                  
##  Mean   : 0.562                      Mean   :0.022                  
##  3rd Qu.: 0.000                      3rd Qu.:0.000                  
##  Max.   :26.000                      Max.   :3.000                  
##  NA's   :5564                        NA's   :5564                   
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0              
##  1st Qu.: 4000            1st Qu.:    0              
##  Median : 9700            Median : 2258              
##  Mean   :11828            Mean   : 4173              
##  3rd Qu.:15000            3rd Qu.: 6902              
##  Max.   :63000            Max.   :22364              
##  NA's   :5564             NA's   :5564               
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-121.000            Min.   :   0.00          
##  1st Qu.: -38.000            1st Qu.:   0.00          
##  Median :  -1.000            Median :   0.00          
##  Mean   :  -5.291            Mean   :  60.99          
##  3rd Qu.:  22.500            3rd Qu.:   0.00          
##  Max.   : 117.000            Max.   :2599.00          
##  NA's   :6129                                         
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 1.000                Min.   : 0.00              Min.   :   125  
##  1st Qu.: 5.000                1st Qu.: 0.00              1st Qu.:131442  
##  Median : 5.000                Median : 1.00              Median :132978  
##  Mean   : 5.263                Mean   : 5.08              Mean   :125607  
##  3rd Qu.: 5.000                3rd Qu.: 1.00              3rd Qu.:134493  
##  Max.   :24.000                Max.   :96.00              Max.   :136486  
##  NA's   :5745                                                             
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2006-03-06   Q1 2014:5710          
##  1st Qu.: 5000      1st Qu.:2014-02-12   Q4 2006:  39          
##  Median :10000      Median :2014-02-21   Q1 2007:  39          
##  Mean   :11431      Mean   :2013-10-12   Q3 2006:  36          
##  3rd Qu.:15000      3rd Qu.:2014-03-04   Q2 2007:  32          
##  Max.   :35000      Max.   :2014-03-12   Q3 2012:  31          
##                                          (Other): 321          
##                    MemberKey    MonthlyLoanPayment LP_CustomerPayments
##  0F0C35762146892131F3BB4:   4   Min.   :  31.52    Min.   :0          
##  19C63381132863377E5F08A:   3   1st Qu.: 174.56    1st Qu.:0          
##  46B3370043839462265FEAF:   3   Median : 330.13    Median :0          
##  51913705343598682656AAA:   3   Mean   : 350.06    Mean   :0          
##  74353588027285527C8B32C:   3   3rd Qu.: 479.09    3rd Qu.:0          
##  744F3495355780315032650:   3   Max.   :1207.30    Max.   :0          
##  (Other)                :6189                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees     
##  Min.   :0                    Min.   :0          Min.   :-92.47000  
##  1st Qu.:0                    1st Qu.:0          1st Qu.:  0.00000  
##  Median :0                    Median :0          Median :  0.00000  
##  Mean   :0                    Mean   :0          Mean   : -0.03636  
##  3rd Qu.:0                    3rd Qu.:0          3rd Qu.:  0.00000  
##  Max.   :0                    Max.   :0          Max.   :  0.00000  
##                                                                     
##  LP_CollectionFees LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-653.67   Min.   :    0.0       Min.   :    0.0    
##  1st Qu.:   0.00   1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :   0.00   Median :    0.0       Median :    0.0    
##  Mean   :  -1.25   Mean   :  356.5       Mean   :  352.6    
##  3rd Qu.:   0.00   3rd Qu.:    0.0       3rd Qu.:    0.0    
##  Max.   :   0.00   Max.   :25000.0       Max.   :25000.0    
##                                                             
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :   0.000                Min.   :0.7197   Min.   :0.000000  
##  1st Qu.:   0.000                1st Qu.:1.0000   1st Qu.:0.000000  
##  Median :   0.000                Median :1.0000   Median :0.000000  
##  Mean   :   4.963                Mean   :0.9999   Mean   :0.005799  
##  3rd Qu.:   0.000                3rd Qu.:1.0000   3rd Qu.:0.000000  
##  Max.   :2440.000                Max.   :1.0000   Max.   :2.000000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors    
##  Min.   :0.00000            Min.   :   0.000            Min.   :  1.0  
##  1st Qu.:0.00000            1st Qu.:   0.000            1st Qu.:  1.0  
##  Median :0.00000            Median :   0.000            Median :  1.0  
##  Mean   :0.00145            Mean   :   1.898            Mean   : 33.7  
##  3rd Qu.:0.00000            3rd Qu.:   0.000            3rd Qu.: 18.0  
##  Max.   :1.00000            Max.   :7425.000            Max.   :745.0  
## 
summary(filter(data, !is.na(ClosedDate) & LP_CustomerPayments==0))
##                    ListingKey  ListingNumber    ListingCreationDate 
##  00AF3373975597240A81AE3:  1   Min.   :   908   Min.   :2006-02-28  
##  013433665791725254947A3:  1   1st Qu.:106927   1st Qu.:2007-03-05  
##  016D3367858315895FD1C66:  1   Median :371903   Median :2008-07-23  
##  017A35059628935388E8DE8:  1   Mean   :343341   Mean   :2009-05-27  
##  03683364814602688549341:  1   3rd Qu.:546908   3rd Qu.:2011-12-27  
##  03B03365352072616FEEA72:  1   Max.   :932346   Max.   :2013-09-26  
##  (Other)                :475                                        
##   CreditGrade       Term                        LoanStatus 
##  HR     :125   Min.   :12.00   Chargedoff            :267  
##  E      : 54   1st Qu.:36.00   Defaulted             :199  
##  C      : 32   Median :36.00   Completed             : 10  
##  D      : 26   Mean   :37.25   Cancelled             :  5  
##  B      : 22   3rd Qu.:36.00   Current               :  0  
##  (Other): 18   Max.   :60.00   FinalPaymentInProgress:  0  
##  NA's   :204                   (Other)               :  0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2006-03-29   Min.   :0.00864   Min.   :0.0021   Min.   :-0.0029  
##  1st Qu.:2007-08-13   1st Qu.:0.24142   1st Qu.:0.2199   1st Qu.: 0.2088  
##  Median :2008-12-31   Median :0.29776   Median :0.2875   Median : 0.2700  
##  Mean   :2009-11-19   Mean   :0.28514   Mean   :0.2612   Mean   : 0.2501  
##  3rd Qu.:2012-05-31   3rd Qu.:0.35372   3rd Qu.:0.3177   3rd Qu.: 0.3077  
##  Max.   :2014-03-03   Max.   :0.42395   Max.   :0.3600   Max.   : 0.3400  
##                                                                           
##  EstimatedEffectiveYield EstimatedLoss    EstimatedReturn  
##  Min.   :-0.0166         Min.   :0.0060   Min.   :-0.0166  
##  1st Qu.: 0.1988         1st Qu.:0.1019   1st Qu.: 0.1158  
##  Median : 0.2760         Median :0.1470   Median : 0.1246  
##  Mean   : 0.2384         Mean   :0.1306   Mean   : 0.1253  
##  3rd Qu.: 0.2896         3rd Qu.:0.1650   3rd Qu.: 0.1439  
##  Max.   : 0.3057         Max.   :0.2500   Max.   : 0.1910  
##  NA's   :277             NA's   :277      NA's   :277      
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     HR     : 74         Min.   : 1.000  
##  1st Qu.:1.000     E      : 51         1st Qu.: 4.000  
##  Median :2.000     D      : 35         Median : 5.000  
##  Mean   :2.368     C      : 25         Mean   : 4.824  
##  3rd Qu.:3.000     B      : 14         3rd Qu.: 6.000  
##  Max.   :7.000     (Other):  5         Max.   :10.000  
##  NA's   :277       NA's   :277         NA's   :277     
##  ListingCategory.num BorrowerState                    Occupation 
##  0      :192         CA     : 66   Other                   :125  
##  1      :113         TX     : 38   Professional            : 33  
##  7      : 67         FL     : 35   Clerical                : 26  
##  2      : 25         IL     : 32   Administrative Assistant: 22  
##  3      : 21         GA     : 28   Sales - Commission      : 17  
##  4      : 14         (Other):263   (Other)                 :227  
##  (Other): 49         NA's   : 19   NA's                    : 31  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :166     Min.   :  0.0            Mode :logical      
##  Employed     :152     1st Qu.: 15.0            FALSE:331          
##  Not available: 79     Median : 43.0            TRUE :150          
##  Self-employed: 29     Mean   : 73.1                               
##  Other        :  8     3rd Qu.:102.5                               
##  (Other)      : 16     Max.   :491.0                               
##  NA's         : 31     NA's   :110                                 
##  CurrentlyInGroup                    GroupKey   DateCreditPulled    
##  Mode :logical    6A3B336601725506917317E: 15   Min.   :2006-02-16  
##  FALSE:351        3D4D3366260257624AB272D: 12   1st Qu.:2007-02-26  
##  TRUE :130        783C3371218786870A73D20: 10   Median :2008-07-12  
##                   FEF83377364176536637E50:  9   Mean   :2009-05-22  
##                   F3BE336490588367617A2BA:  7   3rd Qu.:2011-12-20  
##                   (Other)                : 84   Max.   :2013-09-26  
##                   NA's                   :344                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1964-01-01     
##  1st Qu.:540.0         1st Qu.:559.0         1st Qu.:1992-03-10     
##  Median :640.0         Median :659.0         Median :1997-10-24     
##  Mean   :621.3         Mean   :640.3         Mean   :1996-04-03     
##  3rd Qu.:700.0         3rd Qu.:719.0         3rd Qu.:2001-09-10     
##  Max.   :820.0         Max.   :839.0         Max.   :2010-10-20     
##  NA's   :1             NA's   :1             NA's   :3              
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   : 2.00             
##  1st Qu.: 3.000     1st Qu.: 3.000   1st Qu.:11.00             
##  Median : 7.000     Median : 5.000   Median :18.00             
##  Mean   : 7.372     Mean   : 6.404   Mean   :21.31             
##  3rd Qu.:11.000     3rd Qu.: 9.000   3rd Qu.:28.00             
##  Max.   :41.000     Max.   :40.000   Max.   :99.00             
##  NA's   :110        NA's   :110      NA's   :3                 
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.00         Min.   :   0.0              Min.   : 0.000      
##  1st Qu.: 1.00         1st Qu.:   0.0              1st Qu.: 1.000      
##  Median : 3.00         Median :  65.0              Median : 2.000      
##  Mean   : 4.05         Mean   : 222.1              Mean   : 3.638      
##  3rd Qu.: 6.00         3rd Qu.: 255.0              3rd Qu.: 5.000      
##  Max.   :41.00         Max.   :5467.0              Max.   :53.000      
##                                                    NA's   :3           
##  TotalInquiries   CurrentDelinquencies AmountDelinquent
##  Min.   : 0.000   Min.   : 0.000       Min.   :     0  
##  1st Qu.: 4.000   1st Qu.: 0.000       1st Qu.:     0  
##  Median : 7.000   Median : 0.000       Median :     0  
##  Mean   : 9.344   Mean   : 3.444       Mean   :  3249  
##  3rd Qu.:11.000   3rd Qu.: 4.000       3rd Qu.:   655  
##  Max.   :70.000   Max.   :83.000       Max.   :183396  
##  NA's   :7        NA's   :3            NA's   :111     
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 7.339          Mean   : 0.4561         
##  3rd Qu.: 9.000          3rd Qu.: 1.0000         
##  Max.   :99.000          Max.   :13.0000         
##  NA's   :6               NA's   :3               
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.00000           Min.   :     0         Min.   :0.0000     
##  1st Qu.:0.00000           1st Qu.:   255         1st Qu.:0.0100     
##  Median :0.00000           Median :  2549         Median :0.3800     
##  Mean   :0.03235           Mean   : 11643         Mean   :0.4461     
##  3rd Qu.:0.00000           3rd Qu.:  9612         3rd Qu.:0.8100     
##  Max.   :1.00000           Max.   :277236         Max.   :1.9000     
##  NA's   :110               NA's   :110            NA's   :110        
##  AvailableBankcardCredit  TotalTrades    TradesNeverDelinquent.per
##  Min.   :     0.0        Min.   : 1.00   Min.   :0.0000           
##  1st Qu.:    63.5        1st Qu.: 8.00   1st Qu.:0.6600           
##  Median :  1617.0        Median :15.00   Median :0.9000           
##  Mean   :  8076.8        Mean   :17.23   Mean   :0.7956           
##  3rd Qu.:  9352.0        3rd Qu.:23.00   3rd Qu.:1.0000           
##  Max.   :221237.0        Max.   :71.00   Max.   :1.0000           
##  NA's   :110             NA's   :110     NA's   :110              
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange 
##  Min.   : 0.000          Min.   : 0.0100   $25,000-49,999:164  
##  1st Qu.: 0.000          1st Qu.: 0.0800   Not displayed :113  
##  Median : 1.000          Median : 0.1600   $50,000-74,999: 74  
##  Mean   : 1.234          Mean   : 0.2952   $1-24,999     : 47  
##  3rd Qu.: 2.000          3rd Qu.: 0.3000   $75,000-99,999: 42  
##  Max.   :12.000          Max.   :10.0100   $100,000+     : 34  
##  NA's   :110             NA's   :49        (Other)       :  7  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey   
##  Mode :logical    Min.   :    0       002E362825862155611E637:  1  
##  FALSE:50         1st Qu.: 2250       00CB365406197330833A161:  1  
##  TRUE :431        Median : 3333       01553609568887611BBF798:  1  
##                   Mean   : 4167       02E53381916276403AA12CE:  1  
##                   3rd Qu.: 5083       037C34042225296828F4D0A:  1  
##                   Max.   :25000       03B836476405253355E7A2E:  1  
##                                       (Other)                :475  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.000     Min.   : 3.00              Min.   : 2.00        
##  1st Qu.:1.000     1st Qu.: 6.00              1st Qu.: 6.00        
##  Median :1.000     Median :10.00              Median :10.00        
##  Mean   :1.204     Mean   :14.98              Mean   :13.93        
##  3rd Qu.:1.000     3rd Qu.:20.00              3rd Qu.:17.75        
##  Max.   :3.000     Max.   :54.00              Max.   :54.00        
##  NA's   :437       NA's   :437                NA's   :437          
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.0000                     Min.   :0.0000                 
##  1st Qu.: 0.0000                     1st Qu.:0.0000                 
##  Median : 0.0000                     Median :0.0000                 
##  Mean   : 0.9545                     Mean   :0.0909                 
##  3rd Qu.: 1.0000                     3rd Qu.:0.0000                 
##  Max.   :19.0000                     Max.   :3.0000                 
##  NA's   :437                         NA's   :437                    
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0.000          
##  1st Qu.: 3000            1st Qu.:    0.008          
##  Median : 4400            Median : 2011.520          
##  Mean   : 6192            Mean   : 3086.748          
##  3rd Qu.: 8250            3rd Qu.: 3615.912          
##  Max.   :15900            Max.   :14287.750          
##  NA's   :437              NA's   :437                
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-121.00             Min.   :   0.0           
##  1st Qu.: -61.75             1st Qu.: 219.0           
##  Median : -25.50             Median : 516.0           
##  Mean   : -21.64             Mean   : 784.2           
##  3rd Qu.:   3.75             3rd Qu.:1105.0           
##  Max.   : 117.00             Max.   :2599.0           
##  NA's   :437                                          
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 1.000                Min.   : 5.00              Min.   :   125  
##  1st Qu.: 5.000                1st Qu.:27.00              1st Qu.:  8496  
##  Median : 5.000                Median :67.00              Median : 35037  
##  Mean   : 5.263                Mean   :57.33              Mean   : 35819  
##  3rd Qu.: 5.000                3rd Qu.:84.00              3rd Qu.: 58275  
##  Max.   :24.000                Max.   :96.00              Max.   :103467  
##  NA's   :18                                                               
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2006-03-06   Q4 2006: 39           
##  1st Qu.: 2100      1st Qu.:2007-03-14   Q1 2007: 39           
##  Median : 3500      Median :2008-08-05   Q3 2006: 36           
##  Mean   : 4709      Mean   :2009-06-06   Q2 2007: 32           
##  3rd Qu.: 5000      3rd Qu.:2011-12-30   Q3 2012: 31           
##  Max.   :25000      Max.   :2013-10-01   Q3 2008: 27           
##                                          (Other):277           
##                    MemberKey   MonthlyLoanPayment LP_CustomerPayments
##  00213395573199409CDA304:  1   Min.   :  31.52    Min.   :0          
##  010434177852874428479BC:  1   1st Qu.:  87.49    1st Qu.:0          
##  011D3380183567215AECD54:  1   Median : 135.28    Median :0          
##  01873419418028013A65F9B:  1   Mean   : 184.44    Mean   :0          
##  01963423813444177D162DE:  1   3rd Qu.: 209.53    3rd Qu.:0          
##  01F23424487177149681C65:  1   Max.   :1047.64    Max.   :0          
##  (Other)                :475                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees    
##  Min.   :0                    Min.   :0          Min.   :-92.4700  
##  1st Qu.:0                    1st Qu.:0          1st Qu.:  0.0000  
##  Median :0                    Median :0          Median :  0.0000  
##  Mean   :0                    Mean   :0          Mean   : -0.4692  
##  3rd Qu.:0                    3rd Qu.:0          3rd Qu.:  0.0000  
##  Max.   :0                    Max.   :0          Max.   :  0.0000  
##                                                                    
##  LP_CollectionFees LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-653.67   Min.   :    0         Min.   :    0      
##  1st Qu.:   0.00   1st Qu.: 2000         1st Qu.: 2000      
##  Median :   0.00   Median : 3500         Median : 3300      
##  Mean   : -16.14   Mean   : 4601         Mean   : 4550      
##  3rd Qu.:   0.00   3rd Qu.: 5000         3rd Qu.: 5000      
##  Max.   :   0.00   Max.   :25000         Max.   :25000      
##                                                             
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations  
##  Min.   :   0.00                 Min.   :0.7197   Min.   :0.00000  
##  1st Qu.:   0.00                 1st Qu.:1.0000   1st Qu.:0.00000  
##  Median :   0.00                 Median :1.0000   Median :0.00000  
##  Mean   :  64.06                 Mean   :0.9986   Mean   :0.05405  
##  3rd Qu.:   0.00                 3rd Qu.:1.0000   3rd Qu.:0.00000  
##  Max.   :2440.00                 Max.   :1.0000   Max.   :2.00000  
##                                                                    
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   :0.00000            Min.   :   0.00             Min.   :  1.00  
##  1st Qu.:0.00000            1st Qu.:   0.00             1st Qu.: 16.00  
##  Median :0.00000            Median :   0.00             Median : 36.00  
##  Mean   :0.01871            Mean   :  24.49             Mean   : 58.08  
##  3rd Qu.:0.00000            3rd Qu.:   0.00             3rd Qu.: 71.00  
##  Max.   :1.00000            Max.   :7425.00             Max.   :745.00  
## 
summary(filter(data, !is.na(ClosedDate) & LP_CustomerPayments==0 & LoanStatus=="Completed"))
##                    ListingKey ListingNumber    ListingCreationDate 
##  0D113451667173664D2D2EB:1    Min.   :415054   Min.   :2009-04-28  
##  21F63451223082614E3D321:1    1st Qu.:415118   1st Qu.:2009-04-28  
##  27D034509373504094A753E:1    Median :415327   Median :2009-04-29  
##  36C63450215037018088662:1    Mean   :415293   Mean   :2009-04-29  
##  44773450501513236DCFCEA:1    3rd Qu.:415412   3rd Qu.:2009-04-30  
##  532034516025331845B3905:1    Max.   :415577   Max.   :2009-05-02  
##  (Other)                :4                                         
##   CreditGrade      Term                     LoanStatus
##  C      :3    Min.   :36   Completed             :10  
##  B      :3    1st Qu.:36   Cancelled             : 0  
##  D      :2    Median :36   Chargedoff            : 0  
##  A      :1    Mean   :36   Current               : 0  
##  AA     :1    3rd Qu.:36   Defaulted             : 0  
##  NC     :0    Max.   :36   FinalPaymentInProgress: 0  
##  (Other):0                 (Other)               : 0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield    
##  Min.   :2009-06-15   Min.   :0.09677   Min.   :0.0760   Min.   :0.0660  
##  1st Qu.:2010-11-26   1st Qu.:0.12887   1st Qu.:0.1018   1st Qu.:0.0918  
##  Median :2011-08-19   Median :0.20999   Median :0.1632   Median :0.1532  
##  Mean   :2011-05-26   Mean   :0.22253   Mean   :0.1863   Mean   :0.1763  
##  3rd Qu.:2012-02-24   3rd Qu.:0.27000   3rd Qu.:0.2335   3rd Qu.:0.2235  
##  Max.   :2012-05-13   Max.   :0.39951   Max.   :0.3500   Max.   :0.3400  
##                                                                          
##  EstimatedEffectiveYield EstimatedLoss EstimatedReturn ProsperRating.num
##  Min.   : NA             Min.   : NA   Min.   : NA     Min.   : NA      
##  1st Qu.: NA             1st Qu.: NA   1st Qu.: NA     1st Qu.: NA      
##  Median : NA             Median : NA   Median : NA     Median : NA      
##  Mean   :NaN             Mean   :NaN   Mean   :NaN     Mean   :NaN      
##  3rd Qu.: NA             3rd Qu.: NA   3rd Qu.: NA     3rd Qu.: NA      
##  Max.   : NA             Max.   : NA   Max.   : NA     Max.   : NA      
##  NA's   :10              NA's   :10    NA's   :10      NA's   :10       
##  ProsperRating.alpha  ProsperScore ListingCategory.num BorrowerState
##  NC     : 0          Min.   : NA   1      :4           GA     :3    
##  HR     : 0          1st Qu.: NA   5      :2           IL     :1    
##  E      : 0          Median : NA   7      :2           MD     :1    
##  D      : 0          Mean   :NaN   3      :1           MN     :1    
##  C      : 0          3rd Qu.: NA   6      :1           NJ     :1    
##  (Other): 0          Max.   : NA   0      :0           OH     :1    
##  NA's   :10          NA's   :10    (Other):0           (Other):2    
##                    Occupation      EmploymentStatus
##  Clerical               :1    Full-time    :9      
##  Computer Programmer    :1    Retired      :1      
##  Engineer - Electrical  :1    Employed     :0      
##  Food Service Management:1    Not available:0      
##  Military Enlisted      :1    Not employed :0      
##  Other                  :1    Other        :0      
##  (Other)                :4    (Other)      :0      
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  7.00           Mode :logical       Mode :logical   
##  1st Qu.: 29.00           FALSE:5             FALSE:9         
##  Median : 65.00           TRUE :5             TRUE :1         
##  Mean   : 74.60                                               
##  3rd Qu.: 91.75                                               
##  Max.   :201.00                                               
##                                                               
##                     GroupKey DateCreditPulled     CreditScoreRangeLower
##  FEF83377364176536637E50:1   Min.   :2009-04-28   Min.   :620          
##  00343376901312423168731:0   1st Qu.:2009-04-28   1st Qu.:640          
##  00943382969547936B0C529:0   Median :2009-04-28   Median :670          
##  00AE3392027644405556335:0   Mean   :2009-04-28   Mean   :682          
##  016833805323396548B2370:0   3rd Qu.:2009-04-28   3rd Qu.:700          
##  (Other)                :0   Max.   :2009-04-30   Max.   :820          
##  NA's                   :9                                             
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   :639           Min.   :1976-02-25      Min.   : 3.00     
##  1st Qu.:659           1st Qu.:1989-04-08      1st Qu.: 6.25     
##  Median :689           Median :1994-01-04      Median :10.00     
##  Mean   :701           Mean   :1994-02-22      Mean   :10.80     
##  3rd Qu.:719           3rd Qu.:2001-10-18      3rd Qu.:14.25     
##  Max.   :839           Max.   :2007-05-17      Max.   :21.00     
##                                                                  
##  OpenCreditLines TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 3.00   Min.   : 7.0               Min.   :2.00         
##  1st Qu.: 6.25   1st Qu.:15.0               1st Qu.:4.25         
##  Median : 8.50   Median :21.5               Median :6.50         
##  Mean   : 8.70   Mean   :30.4               Mean   :6.00         
##  3rd Qu.:11.25   3rd Qu.:39.0               3rd Qu.:7.75         
##  Max.   :14.00   Max.   :78.0               Max.   :9.00         
##                                                                  
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries 
##  Min.   : 15.0               Min.   :0.0          Min.   : 1.00  
##  1st Qu.: 98.5               1st Qu.:0.0          1st Qu.: 3.25  
##  Median :171.0               Median :0.0          Median : 5.00  
##  Mean   :223.3               Mean   :1.3          Mean   : 6.70  
##  3rd Qu.:269.2               3rd Qu.:1.0          3rd Qu.: 7.25  
##  Max.   :840.0               Max.   :8.0          Max.   :23.00  
##                                                                  
##  CurrentDelinquencies AmountDelinquent DelinquenciesLast7Years
##  Min.   :0            Min.   :0        Min.   :0.0            
##  1st Qu.:0            1st Qu.:0        1st Qu.:0.0            
##  Median :0            Median :0        Median :0.0            
##  Mean   :0            Mean   :0        Mean   :0.9            
##  3rd Qu.:0            3rd Qu.:0        3rd Qu.:0.0            
##  Max.   :0            Max.   :0        Max.   :5.0            
##                                                               
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   :0.0              Min.   :0                 Min.   :  498         
##  1st Qu.:0.0              1st Qu.:0                 1st Qu.: 1867         
##  Median :0.0              Median :0                 Median : 4888         
##  Mean   :0.1              Mean   :0                 Mean   :10410         
##  3rd Qu.:0.0              3rd Qu.:0                 3rd Qu.:12075         
##  Max.   :1.0              Max.   :0                 Max.   :35522         
##                                                                           
##  BankcardUtilization AvailableBankcardCredit  TotalTrades  
##  Min.   :0.0500      Min.   :  129           Min.   : 3.0  
##  1st Qu.:0.3000      1st Qu.:  607           1st Qu.:13.0  
##  Median :0.4250      Median : 4952           Median :18.5  
##  Mean   :0.5240      Mean   :10860           Mean   :25.7  
##  3rd Qu.:0.8475      3rd Qu.:19643           3rd Qu.:37.0  
##  Max.   :0.9900      Max.   :33730           Max.   :68.0  
##                                                            
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.420             Min.   :0.0             Min.   :0.1000   
##  1st Qu.:0.890             1st Qu.:0.0             1st Qu.:0.1225   
##  Median :0.960             Median :1.0             Median :0.1650   
##  Mean   :0.896             Mean   :1.4             Mean   :0.2700   
##  3rd Qu.:1.000             3rd Qu.:2.0             3rd Qu.:0.2175   
##  Max.   :1.000             Max.   :6.0             Max.   :1.2700   
##                                                                     
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  $25,000-49,999:5    Mode:logical     Min.   :   66.67   
##  $50,000-74,999:2    TRUE:10          1st Qu.: 3147.50   
##  $100,000+     :2                     Median : 3817.25   
##  $1-24,999     :1                     Mean   : 4753.66   
##  Not displayed :0                     3rd Qu.: 5750.00   
##  Not employed  :0                     Max.   :10885.42   
##  (Other)       :0                                        
##                     LoanKey  TotalProsperLoans TotalProsperPaymentsBilled
##  04B13555695010627F64371:1   Min.   :1.000     Min.   : 6.00             
##  07D135567368272216AB044:1   1st Qu.:1.000     1st Qu.:10.00             
##  158335561462412446E1A4D:1   Median :1.000     Median :11.00             
##  24973555335786563CA1C8D:1   Mean   :1.444     Mean   :13.67             
##  98F5355630240037783B9E6:1   3rd Qu.:2.000     3rd Qu.:13.00             
##  9B4B355449621339172BFA4:1   Max.   :3.000     Max.   :33.00             
##  (Other)                :4   NA's   :1         NA's   :1                 
##  OnTimeProsperPayments ProsperPaymentsLessThanOneMonthLate
##  Min.   : 6.00         Min.   :0                          
##  1st Qu.:10.00         1st Qu.:0                          
##  Median :11.00         Median :0                          
##  Mean   :13.67         Mean   :0                          
##  3rd Qu.:13.00         3rd Qu.:0                          
##  Max.   :33.00         Max.   :0                          
##  NA's   :1             NA's   :1                          
##  ProsperPaymentsOneMonthPlusLate ProsperPrincipalBorrowed
##  Min.   :0                       Min.   : 1000           
##  1st Qu.:0                       1st Qu.: 3000           
##  Median :0                       Median : 5102           
##  Mean   :0                       Mean   : 6178           
##  3rd Qu.:0                       3rd Qu.: 9000           
##  Max.   :0                       Max.   :15500           
##  NA's   :1                       NA's   :1               
##  ProsperPrincipalOutstanding ScorexChangeAtTimeOfListing
##  Min.   :   0.0              Min.   :-57.000            
##  1st Qu.:   0.0              1st Qu.:-22.000            
##  Median : 481.6              Median : -2.000            
##  Mean   :1243.7              Mean   :  7.222            
##  3rd Qu.:2260.8              3rd Qu.: 21.000            
##  Max.   :3987.8              Max.   :107.000            
##  NA's   :1                   NA's   :1                  
##  LoanCurrentDaysDelinquent LoanFirstDefaultedCycleNumber
##  Min.   :0                 Min.   : NA                  
##  1st Qu.:0                 1st Qu.: NA                  
##  Median :0                 Median : NA                  
##  Mean   :0                 Mean   :NaN                  
##  3rd Qu.:0                 3rd Qu.: NA                  
##  Max.   :0                 Max.   : NA                  
##                            NA's   :10                   
##  LoanMonthsSinceOrigination   LoanNumber    LoanOriginalAmount
##  Min.   :58                 Min.   :38031   Min.   :1000      
##  1st Qu.:58                 1st Qu.:38033   1st Qu.:1125      
##  Median :58                 Median :38036   Median :1850      
##  Mean   :58                 Mean   :38036   Mean   :2270      
##  3rd Qu.:58                 3rd Qu.:38038   3rd Qu.:3000      
##  Max.   :58                 Max.   :38044   Max.   :5000      
##                                                               
##  LoanOriginationDate  LoanOriginationQuarter                   MemberKey
##  Min.   :2009-05-06   Q2 2009:10             01873419418028013A65F9B:1  
##  1st Qu.:2009-05-07   Q1 2006: 0             0588342364795854665007E:1  
##  Median :2009-05-12   Q2 2006: 0             08B9341462500905990325D:1  
##  Mean   :2009-05-10   Q3 2006: 0             1177340984660368892073C:1  
##  3rd Qu.:2009-05-13   Q4 2006: 0             43D93390371566774874F59:1  
##  Max.   :2009-05-14   Q1 2007: 0             63CA34120866140639431C9:1  
##                       (Other): 0             (Other)                :4  
##  MonthlyLoanPayment LP_CustomerPayments LP_CustomerPrincipalPayments
##  Min.   : 35.40     Min.   :0           Min.   :0                   
##  1st Qu.: 46.57     1st Qu.:0           1st Qu.:0                   
##  Median : 60.97     Median :0           Median :0                   
##  Mean   : 81.66     Mean   :0           Mean   :0                   
##  3rd Qu.:108.95     3rd Qu.:0           3rd Qu.:0                   
##  Max.   :158.33     Max.   :0           Max.   :0                   
##                                                                     
##  LP_InterestandFees LP_ServiceFees LP_CollectionFees LP_GrossPrincipalLoss
##  Min.   :0          Min.   :0      Min.   :0         Min.   :0            
##  1st Qu.:0          1st Qu.:0      1st Qu.:0         1st Qu.:0            
##  Median :0          Median :0      Median :0         Median :0            
##  Mean   :0          Mean   :0      Mean   :0         Mean   :0            
##  3rd Qu.:0          3rd Qu.:0      3rd Qu.:0         3rd Qu.:0            
##  Max.   :0          Max.   :0      Max.   :0         Max.   :0            
##                                                                           
##  LP_NetPrincipalLoss LP_NonPrincipalRecoverypayments PercentFunded
##  Min.   :0           Min.   :0                       Min.   :1    
##  1st Qu.:0           1st Qu.:0                       1st Qu.:1    
##  Median :0           Median :0                       Median :1    
##  Mean   :0           Mean   :0                       Mean   :1    
##  3rd Qu.:0           3rd Qu.:0                       3rd Qu.:1    
##  Max.   :0           Max.   :0                       Max.   :1    
##                                                                   
##  Recommendations InvestmentFromFriendsCount InvestmentFromFriendsAmount
##  Min.   :0.0     Min.   :0                  Min.   :0                  
##  1st Qu.:0.0     1st Qu.:0                  1st Qu.:0                  
##  Median :0.0     Median :0                  Median :0                  
##  Mean   :0.2     Mean   :0                  Mean   :0                  
##  3rd Qu.:0.0     3rd Qu.:0                  3rd Qu.:0                  
##  Max.   :1.0     Max.   :0                  Max.   :0                  
##                                                                        
##    Investors    
##  Min.   :15.00  
##  1st Qu.:21.75  
##  Median :42.00  
##  Mean   :39.90  
##  3rd Qu.:48.50  
##  Max.   :79.00  
## 
summary(filter(data, LoanStatus=="Completed" & LP_CustomerPayments==0))
##                    ListingKey ListingNumber    ListingCreationDate 
##  0D113451667173664D2D2EB:1    Min.   :415054   Min.   :2009-04-28  
##  21F63451223082614E3D321:1    1st Qu.:415118   1st Qu.:2009-04-28  
##  27D034509373504094A753E:1    Median :415327   Median :2009-04-29  
##  36C63450215037018088662:1    Mean   :415293   Mean   :2009-04-29  
##  44773450501513236DCFCEA:1    3rd Qu.:415412   3rd Qu.:2009-04-30  
##  532034516025331845B3905:1    Max.   :415577   Max.   :2009-05-02  
##  (Other)                :4                                         
##   CreditGrade      Term                     LoanStatus
##  C      :3    Min.   :36   Completed             :10  
##  B      :3    1st Qu.:36   Cancelled             : 0  
##  D      :2    Median :36   Chargedoff            : 0  
##  A      :1    Mean   :36   Current               : 0  
##  AA     :1    3rd Qu.:36   Defaulted             : 0  
##  NC     :0    Max.   :36   FinalPaymentInProgress: 0  
##  (Other):0                 (Other)               : 0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield    
##  Min.   :2009-06-15   Min.   :0.09677   Min.   :0.0760   Min.   :0.0660  
##  1st Qu.:2010-11-26   1st Qu.:0.12887   1st Qu.:0.1018   1st Qu.:0.0918  
##  Median :2011-08-19   Median :0.20999   Median :0.1632   Median :0.1532  
##  Mean   :2011-05-26   Mean   :0.22253   Mean   :0.1863   Mean   :0.1763  
##  3rd Qu.:2012-02-24   3rd Qu.:0.27000   3rd Qu.:0.2335   3rd Qu.:0.2235  
##  Max.   :2012-05-13   Max.   :0.39951   Max.   :0.3500   Max.   :0.3400  
##                                                                          
##  EstimatedEffectiveYield EstimatedLoss EstimatedReturn ProsperRating.num
##  Min.   : NA             Min.   : NA   Min.   : NA     Min.   : NA      
##  1st Qu.: NA             1st Qu.: NA   1st Qu.: NA     1st Qu.: NA      
##  Median : NA             Median : NA   Median : NA     Median : NA      
##  Mean   :NaN             Mean   :NaN   Mean   :NaN     Mean   :NaN      
##  3rd Qu.: NA             3rd Qu.: NA   3rd Qu.: NA     3rd Qu.: NA      
##  Max.   : NA             Max.   : NA   Max.   : NA     Max.   : NA      
##  NA's   :10              NA's   :10    NA's   :10      NA's   :10       
##  ProsperRating.alpha  ProsperScore ListingCategory.num BorrowerState
##  NC     : 0          Min.   : NA   1      :4           GA     :3    
##  HR     : 0          1st Qu.: NA   5      :2           IL     :1    
##  E      : 0          Median : NA   7      :2           MD     :1    
##  D      : 0          Mean   :NaN   3      :1           MN     :1    
##  C      : 0          3rd Qu.: NA   6      :1           NJ     :1    
##  (Other): 0          Max.   : NA   0      :0           OH     :1    
##  NA's   :10          NA's   :10    (Other):0           (Other):2    
##                    Occupation      EmploymentStatus
##  Clerical               :1    Full-time    :9      
##  Computer Programmer    :1    Retired      :1      
##  Engineer - Electrical  :1    Employed     :0      
##  Food Service Management:1    Not available:0      
##  Military Enlisted      :1    Not employed :0      
##  Other                  :1    Other        :0      
##  (Other)                :4    (Other)      :0      
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  7.00           Mode :logical       Mode :logical   
##  1st Qu.: 29.00           FALSE:5             FALSE:9         
##  Median : 65.00           TRUE :5             TRUE :1         
##  Mean   : 74.60                                               
##  3rd Qu.: 91.75                                               
##  Max.   :201.00                                               
##                                                               
##                     GroupKey DateCreditPulled     CreditScoreRangeLower
##  FEF83377364176536637E50:1   Min.   :2009-04-28   Min.   :620          
##  00343376901312423168731:0   1st Qu.:2009-04-28   1st Qu.:640          
##  00943382969547936B0C529:0   Median :2009-04-28   Median :670          
##  00AE3392027644405556335:0   Mean   :2009-04-28   Mean   :682          
##  016833805323396548B2370:0   3rd Qu.:2009-04-28   3rd Qu.:700          
##  (Other)                :0   Max.   :2009-04-30   Max.   :820          
##  NA's                   :9                                             
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   :639           Min.   :1976-02-25      Min.   : 3.00     
##  1st Qu.:659           1st Qu.:1989-04-08      1st Qu.: 6.25     
##  Median :689           Median :1994-01-04      Median :10.00     
##  Mean   :701           Mean   :1994-02-22      Mean   :10.80     
##  3rd Qu.:719           3rd Qu.:2001-10-18      3rd Qu.:14.25     
##  Max.   :839           Max.   :2007-05-17      Max.   :21.00     
##                                                                  
##  OpenCreditLines TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 3.00   Min.   : 7.0               Min.   :2.00         
##  1st Qu.: 6.25   1st Qu.:15.0               1st Qu.:4.25         
##  Median : 8.50   Median :21.5               Median :6.50         
##  Mean   : 8.70   Mean   :30.4               Mean   :6.00         
##  3rd Qu.:11.25   3rd Qu.:39.0               3rd Qu.:7.75         
##  Max.   :14.00   Max.   :78.0               Max.   :9.00         
##                                                                  
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries 
##  Min.   : 15.0               Min.   :0.0          Min.   : 1.00  
##  1st Qu.: 98.5               1st Qu.:0.0          1st Qu.: 3.25  
##  Median :171.0               Median :0.0          Median : 5.00  
##  Mean   :223.3               Mean   :1.3          Mean   : 6.70  
##  3rd Qu.:269.2               3rd Qu.:1.0          3rd Qu.: 7.25  
##  Max.   :840.0               Max.   :8.0          Max.   :23.00  
##                                                                  
##  CurrentDelinquencies AmountDelinquent DelinquenciesLast7Years
##  Min.   :0            Min.   :0        Min.   :0.0            
##  1st Qu.:0            1st Qu.:0        1st Qu.:0.0            
##  Median :0            Median :0        Median :0.0            
##  Mean   :0            Mean   :0        Mean   :0.9            
##  3rd Qu.:0            3rd Qu.:0        3rd Qu.:0.0            
##  Max.   :0            Max.   :0        Max.   :5.0            
##                                                               
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   :0.0              Min.   :0                 Min.   :  498         
##  1st Qu.:0.0              1st Qu.:0                 1st Qu.: 1867         
##  Median :0.0              Median :0                 Median : 4888         
##  Mean   :0.1              Mean   :0                 Mean   :10410         
##  3rd Qu.:0.0              3rd Qu.:0                 3rd Qu.:12075         
##  Max.   :1.0              Max.   :0                 Max.   :35522         
##                                                                           
##  BankcardUtilization AvailableBankcardCredit  TotalTrades  
##  Min.   :0.0500      Min.   :  129           Min.   : 3.0  
##  1st Qu.:0.3000      1st Qu.:  607           1st Qu.:13.0  
##  Median :0.4250      Median : 4952           Median :18.5  
##  Mean   :0.5240      Mean   :10860           Mean   :25.7  
##  3rd Qu.:0.8475      3rd Qu.:19643           3rd Qu.:37.0  
##  Max.   :0.9900      Max.   :33730           Max.   :68.0  
##                                                            
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.420             Min.   :0.0             Min.   :0.1000   
##  1st Qu.:0.890             1st Qu.:0.0             1st Qu.:0.1225   
##  Median :0.960             Median :1.0             Median :0.1650   
##  Mean   :0.896             Mean   :1.4             Mean   :0.2700   
##  3rd Qu.:1.000             3rd Qu.:2.0             3rd Qu.:0.2175   
##  Max.   :1.000             Max.   :6.0             Max.   :1.2700   
##                                                                     
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  $25,000-49,999:5    Mode:logical     Min.   :   66.67   
##  $50,000-74,999:2    TRUE:10          1st Qu.: 3147.50   
##  $100,000+     :2                     Median : 3817.25   
##  $1-24,999     :1                     Mean   : 4753.66   
##  Not displayed :0                     3rd Qu.: 5750.00   
##  Not employed  :0                     Max.   :10885.42   
##  (Other)       :0                                        
##                     LoanKey  TotalProsperLoans TotalProsperPaymentsBilled
##  04B13555695010627F64371:1   Min.   :1.000     Min.   : 6.00             
##  07D135567368272216AB044:1   1st Qu.:1.000     1st Qu.:10.00             
##  158335561462412446E1A4D:1   Median :1.000     Median :11.00             
##  24973555335786563CA1C8D:1   Mean   :1.444     Mean   :13.67             
##  98F5355630240037783B9E6:1   3rd Qu.:2.000     3rd Qu.:13.00             
##  9B4B355449621339172BFA4:1   Max.   :3.000     Max.   :33.00             
##  (Other)                :4   NA's   :1         NA's   :1                 
##  OnTimeProsperPayments ProsperPaymentsLessThanOneMonthLate
##  Min.   : 6.00         Min.   :0                          
##  1st Qu.:10.00         1st Qu.:0                          
##  Median :11.00         Median :0                          
##  Mean   :13.67         Mean   :0                          
##  3rd Qu.:13.00         3rd Qu.:0                          
##  Max.   :33.00         Max.   :0                          
##  NA's   :1             NA's   :1                          
##  ProsperPaymentsOneMonthPlusLate ProsperPrincipalBorrowed
##  Min.   :0                       Min.   : 1000           
##  1st Qu.:0                       1st Qu.: 3000           
##  Median :0                       Median : 5102           
##  Mean   :0                       Mean   : 6178           
##  3rd Qu.:0                       3rd Qu.: 9000           
##  Max.   :0                       Max.   :15500           
##  NA's   :1                       NA's   :1               
##  ProsperPrincipalOutstanding ScorexChangeAtTimeOfListing
##  Min.   :   0.0              Min.   :-57.000            
##  1st Qu.:   0.0              1st Qu.:-22.000            
##  Median : 481.6              Median : -2.000            
##  Mean   :1243.7              Mean   :  7.222            
##  3rd Qu.:2260.8              3rd Qu.: 21.000            
##  Max.   :3987.8              Max.   :107.000            
##  NA's   :1                   NA's   :1                  
##  LoanCurrentDaysDelinquent LoanFirstDefaultedCycleNumber
##  Min.   :0                 Min.   : NA                  
##  1st Qu.:0                 1st Qu.: NA                  
##  Median :0                 Median : NA                  
##  Mean   :0                 Mean   :NaN                  
##  3rd Qu.:0                 3rd Qu.: NA                  
##  Max.   :0                 Max.   : NA                  
##                            NA's   :10                   
##  LoanMonthsSinceOrigination   LoanNumber    LoanOriginalAmount
##  Min.   :58                 Min.   :38031   Min.   :1000      
##  1st Qu.:58                 1st Qu.:38033   1st Qu.:1125      
##  Median :58                 Median :38036   Median :1850      
##  Mean   :58                 Mean   :38036   Mean   :2270      
##  3rd Qu.:58                 3rd Qu.:38038   3rd Qu.:3000      
##  Max.   :58                 Max.   :38044   Max.   :5000      
##                                                               
##  LoanOriginationDate  LoanOriginationQuarter                   MemberKey
##  Min.   :2009-05-06   Q2 2009:10             01873419418028013A65F9B:1  
##  1st Qu.:2009-05-07   Q1 2006: 0             0588342364795854665007E:1  
##  Median :2009-05-12   Q2 2006: 0             08B9341462500905990325D:1  
##  Mean   :2009-05-10   Q3 2006: 0             1177340984660368892073C:1  
##  3rd Qu.:2009-05-13   Q4 2006: 0             43D93390371566774874F59:1  
##  Max.   :2009-05-14   Q1 2007: 0             63CA34120866140639431C9:1  
##                       (Other): 0             (Other)                :4  
##  MonthlyLoanPayment LP_CustomerPayments LP_CustomerPrincipalPayments
##  Min.   : 35.40     Min.   :0           Min.   :0                   
##  1st Qu.: 46.57     1st Qu.:0           1st Qu.:0                   
##  Median : 60.97     Median :0           Median :0                   
##  Mean   : 81.66     Mean   :0           Mean   :0                   
##  3rd Qu.:108.95     3rd Qu.:0           3rd Qu.:0                   
##  Max.   :158.33     Max.   :0           Max.   :0                   
##                                                                     
##  LP_InterestandFees LP_ServiceFees LP_CollectionFees LP_GrossPrincipalLoss
##  Min.   :0          Min.   :0      Min.   :0         Min.   :0            
##  1st Qu.:0          1st Qu.:0      1st Qu.:0         1st Qu.:0            
##  Median :0          Median :0      Median :0         Median :0            
##  Mean   :0          Mean   :0      Mean   :0         Mean   :0            
##  3rd Qu.:0          3rd Qu.:0      3rd Qu.:0         3rd Qu.:0            
##  Max.   :0          Max.   :0      Max.   :0         Max.   :0            
##                                                                           
##  LP_NetPrincipalLoss LP_NonPrincipalRecoverypayments PercentFunded
##  Min.   :0           Min.   :0                       Min.   :1    
##  1st Qu.:0           1st Qu.:0                       1st Qu.:1    
##  Median :0           Median :0                       Median :1    
##  Mean   :0           Mean   :0                       Mean   :1    
##  3rd Qu.:0           3rd Qu.:0                       3rd Qu.:1    
##  Max.   :0           Max.   :0                       Max.   :1    
##                                                                   
##  Recommendations InvestmentFromFriendsCount InvestmentFromFriendsAmount
##  Min.   :0.0     Min.   :0                  Min.   :0                  
##  1st Qu.:0.0     1st Qu.:0                  1st Qu.:0                  
##  Median :0.0     Median :0                  Median :0                  
##  Mean   :0.2     Mean   :0                  Mean   :0                  
##  3rd Qu.:0.0     3rd Qu.:0                  3rd Qu.:0                  
##  Max.   :1.0     Max.   :0                  Max.   :0                  
##                                                                        
##    Investors    
##  Min.   :15.00  
##  1st Qu.:21.75  
##  Median :42.00  
##  Mean   :39.90  
##  3rd Qu.:48.50  
##  Max.   :79.00  
## 

The one thing I notice is that the majority of these loans originated in 2014. Most, but not all, are still open. Other LP values for most of them are 0. Of those that are closed, most were either charged off, defaulted, or cancelled - only 10 were completed. Of those that were completed, all are from the same loan origination quarter, have the same number of months since the loan originated, and are assigned loan numbers adjacent to each other. All LP values are 0.

If the loan has been completed, then with the exception of those 10 records, all customers have made payments.

This leads me to strongly suspect that for this group of loans, the data is simply missing, perhaps through system error. I therefore tentatively conclude that for this measure the value is 0 only for those borrowers whole loans are still open, or whole loans were charged off, defaulted, or cancelled.

summary(filter(data, LP_CollectionFees!=0))
##                    ListingKey   ListingNumber     ListingCreationDate 
##  0CDD3589734051739A10B58:   2   Min.   :     28   Min.   :2005-11-21  
##  426B3588416323222A031B6:   2   1st Qu.: 188371   1st Qu.:2007-08-16  
##  87EE35921635083297DEB55:   2   Median : 404988   Median :2008-09-26  
##  0005353671687550573289D:   1   Mean   : 393687   Mean   :2009-10-30  
##  000D348547019249114C31E:   1   3rd Qu.: 564927   3rd Qu.:2012-03-03  
##  001035373445372274F74E2:   1   Max.   :1117488   Max.   :2014-01-09  
##  (Other)                :8157                                         
##   CreditGrade        Term                       LoanStatus  
##  D      : 836   Min.   :12.00   Chargedoff           :3510  
##  C      : 802   1st Qu.:36.00   Defaulted            :1381  
##  HR     : 721   Median :36.00   Completed            :1339  
##  E      : 636   Mean   :38.32   Current              :1034  
##  B      : 577   3rd Qu.:36.00   Past Due (1-15 days) : 379  
##  (Other): 546   Max.   :60.00   Past Due (31-60 days): 149  
##  NA's   :4048                   (Other)              : 374  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2006-09-22   Min.   :0.01315   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2009-04-04   1st Qu.:0.19212   1st Qu.:0.1750   1st Qu.: 0.1650  
##  Median :2010-05-29   Median :0.25757   Median :0.2375   Median : 0.2250  
##  Mean   :2010-10-20   Mean   :0.25496   Mean   :0.2317   Mean   : 0.2211  
##  3rd Qu.:2012-09-11   3rd Qu.:0.31375   3rd Qu.:0.2900   3rd Qu.: 0.2809  
##  Max.   :2014-03-10   Max.   :0.45857   Max.   :0.4500   Max.   : 0.4325  
##  NA's   :1936         NA's   :1                                           
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.126          Min.   :0.006   Min.   :-0.126  
##  1st Qu.: 0.164          1st Qu.:0.072   1st Qu.: 0.105  
##  Median : 0.231          Median :0.108   Median : 0.124  
##  Mean   : 0.215          Mean   :0.110   Mean   : 0.120  
##  3rd Qu.: 0.280          3rd Qu.:0.147   3rd Qu.: 0.141  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :4118            NA's   :4118    NA's   :4118    
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      :1040        Min.   : 1.000  
##  1st Qu.:2.000     E      : 806        1st Qu.: 4.000  
##  Median :3.000     HR     : 791        Median : 5.000  
##  Mean   :3.031     C      : 665        Mean   : 5.342  
##  3rd Qu.:4.000     B      : 439        3rd Qu.: 7.000  
##  Max.   :7.000     (Other): 307        Max.   :11.000  
##  NA's   :4118      NA's   :4118        NA's   :4118    
##  ListingCategory.num BorrowerState                     Occupation  
##  1      :2788        CA     :1008   Other                   :2176  
##  0      :2485        IL     : 467   Professional            : 926  
##  7      : 773        GA     : 453   Administrative Assistant: 357  
##  3      : 606        NY     : 443   Teacher                 : 339  
##  2      : 475        FL     : 429   Sales - Commission      : 317  
##  4      : 321        (Other):4634   (Other)                 :3757  
##  (Other): 718        NA's   : 732   NA's                    : 294  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :3200    Min.   :  0.00           Mode :logical      
##  Employed     :2937    1st Qu.: 20.00           FALSE:4519         
##  Not available: 753    Median : 53.00           TRUE :3647         
##  Self-employed: 441    Mean   : 84.12                              
##  Other        : 185    3rd Qu.:118.00                              
##  (Other)      : 365    Max.   :732.00                              
##  NA's         : 285    NA's   :1038                                
##  CurrentlyInGroup                    GroupKey    DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20: 197   Min.   :2005-11-21  
##  FALSE:6347       FEF83377364176536637E50: 132   1st Qu.:2007-08-08  
##  TRUE :1819       3D4D3366260257624AB272D: 116   Median :2008-09-21  
##                   6A3B336601725506917317E:  94   Mean   :2009-10-27  
##                   FE113364863511529673D04:  58   3rd Qu.:2012-03-06  
##                   (Other)                :1306   Max.   :2014-01-06  
##                   NA's                   :6263                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1951-07-04     
##  1st Qu.:620.0         1st Qu.:639.0         1st Qu.:1989-12-08     
##  Median :660.0         Median :679.0         Median :1995-06-07     
##  Mean   :651.4         Mean   :670.4         Mean   :1994-06-13     
##  3rd Qu.:700.0         3rd Qu.:719.0         3rd Qu.:1999-09-30     
##  Max.   :880.0         Max.   :899.0         Max.   :2011-08-10     
##  NA's   :62            NA's   :62            NA's   :73             
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 5.000     1st Qu.: 4.000   1st Qu.: 15.00            
##  Median : 8.000     Median : 7.000   Median : 24.00            
##  Mean   : 8.959     Mean   : 7.741   Mean   : 25.74            
##  3rd Qu.:12.000     3rd Qu.:10.000   3rd Qu.: 34.00            
##  Max.   :52.000     Max.   :48.000   Max.   :108.00            
##  NA's   :1038       NA's   :1038     NA's   :73                
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 2.000        1st Qu.:   45.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  151.0             Median :  1.000     
##  Mean   : 5.416        Mean   :  300.5             Mean   :  2.416     
##  3rd Qu.: 8.000        3rd Qu.:  378.0             3rd Qu.:  3.000     
##  Max.   :40.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :73          
##  TotalInquiries    CurrentDelinquencies AmountDelinquent   
##  Min.   :  0.000   Min.   : 0.000       Min.   :     0.00  
##  1st Qu.:  3.000   1st Qu.: 0.000       1st Qu.:     0.00  
##  Median :  5.000   Median : 0.000       Median :     0.00  
##  Mean   :  8.075   Mean   : 1.226       Mean   :  1353.47  
##  3rd Qu.: 10.000   3rd Qu.: 1.000       3rd Qu.:    66.25  
##  Max.   :379.000   Max.   :64.000       Max.   :215315.00  
##  NA's   :131       NA's   :73           NA's   :1038       
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 6.469          Mean   : 0.4211         
##  3rd Qu.: 8.000          3rd Qu.: 1.0000         
##  Max.   :99.000          Max.   :17.0000         
##  NA's   :118             NA's   :73              
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.0000            Min.   :     0         Min.   :0.0000     
##  1st Qu.:0.0000            1st Qu.:  1072         1st Qu.:0.2500     
##  Median :0.0000            Median :  4540         Median :0.6400     
##  Mean   :0.0363            Mean   : 13469         Mean   :0.5708     
##  3rd Qu.:0.0000            3rd Qu.: 13615         3rd Qu.:0.8900     
##  Max.   :7.0000            Max.   :493300         Max.   :5.8300     
##  NA's   :1038              NA's   :1038           NA's   :1038       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  1.00   Min.   :0.0000           
##  1st Qu.:   192          1st Qu.: 12.00   1st Qu.:0.7000           
##  Median :  1492          Median : 19.00   Median :0.8500           
##  Mean   :  6211          Mean   : 21.46   Mean   :0.8083           
##  3rd Qu.:  6298          3rd Qu.: 29.00   3rd Qu.:0.9700           
##  Max.   :498374          Max.   :102.00   Max.   :1.0000           
##  NA's   :1028            NA's   :1028     NA's   :1028             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.0000         Min.   : 0.0000   $25,000-49,999:2536  
##  1st Qu.: 0.0000         1st Qu.: 0.1400   $50,000-74,999:1856  
##  Median : 1.0000         Median : 0.2100   Not displayed :1055  
##  Mean   : 0.9891         Mean   : 0.3117   $75,000-99,999: 920  
##  3rd Qu.: 2.0000         3rd Qu.: 0.3200   $100,000+     : 824  
##  Max.   :13.0000         Max.   :10.0100   $1-24,999     : 793  
##  NA's   :1028            NA's   :666       (Other)       : 182  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey    
##  Mode :logical    Min.   :      0     18FD3697424163510882853:   2  
##  FALSE:673        1st Qu.:   2667     BD5536972515798992B29C9:   2  
##  TRUE :7493       Median :   4083     FAF336950776856532E5CFC:   2  
##                   Mean   :   5068     00023650503696810C531F7:   1  
##                   3rd Qu.:   6000     000B3366346245964D6187E:   1  
##                   Max.   :1750003     001336793077504887041A4:   1  
##                                       (Other)                :8157  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.0       Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.0       1st Qu.:  9.00             1st Qu.:  8.00       
##  Median :1.0       Median : 14.00             Median : 13.00       
##  Mean   :1.3       Mean   : 21.41             Mean   : 19.56       
##  3rd Qu.:1.0       3rd Qu.: 32.00             3rd Qu.: 27.00       
##  Max.   :5.0       Max.   :123.00             Max.   :103.00       
##  NA's   :6629      NA's   :6629               NA's   :6629         
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   : 0.00                  
##  1st Qu.: 0.000                      1st Qu.: 0.00                  
##  Median : 0.000                      Median : 0.00                  
##  Mean   : 1.682                      Mean   : 0.17                  
##  3rd Qu.: 1.000                      3rd Qu.: 0.00                  
##  Max.   :42.000                      Max.   :21.00                  
##  NA's   :6629                        NA's   :6629                   
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0.00           
##  1st Qu.: 3000            1st Qu.:    0.11           
##  Median : 5000            Median : 2095.15           
##  Mean   : 6648            Mean   : 2971.07           
##  3rd Qu.: 8000            3rd Qu.: 4013.76           
##  Max.   :41000            Max.   :20946.73           
##  NA's   :6629             NA's   :6629               
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-180.000            Min.   :   0.0           
##  1st Qu.: -40.000            1st Qu.:   0.0           
##  Median :  -6.000            Median : 222.5           
##  Mean   :  -9.108            Mean   : 629.1           
##  3rd Qu.:  17.000            3rd Qu.:1280.8           
##  Max.   : 214.000            Max.   :2408.0           
##  NA's   :6655                                         
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 0.00                 Min.   :  2.00             Min.   :     8  
##  1st Qu.:11.00                 1st Qu.: 24.00             1st Qu.: 18357  
##  Median :16.00                 Median : 65.00             Median : 37433  
##  Mean   :17.83                 Mean   : 52.17             Mean   : 41578  
##  3rd Qu.:25.00                 3rd Qu.: 79.00             3rd Qu.: 61954  
##  Max.   :41.00                 Max.   :100.00             Max.   :126049  
##  NA's   :3237                                                             
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-28   Q2 2008: 600          
##  1st Qu.: 3000      1st Qu.:2007-08-27   Q2 2007: 530          
##  Median : 4760      Median :2008-10-06   Q2 2012: 510          
##  Mean   : 6695      Mean   :2009-11-11   Q3 2008: 493          
##  3rd Qu.: 9000      3rd Qu.:2012-03-14   Q3 2012: 493          
##  Max.   :35000      Max.   :2014-01-13   (Other):5539          
##                                          NA's   :   1          
##                    MemberKey    MonthlyLoanPayment LP_CustomerPayments
##  0196338772112490878A367:   3   Min.   :   0.0     Min.   :   -2.35   
##  13F63388456517019BEEF82:   3   1st Qu.: 108.6     1st Qu.: 1450.01   
##  2AFA35685188842990454C4:   3   Median : 173.7     Median : 3000.88   
##  47DB3372200088144B54373:   3   Mean   : 233.4     Mean   : 4427.73   
##  63D833653414495348BC9AA:   3   3rd Qu.: 314.2     3rd Qu.: 5741.77   
##  72DC3382168310312DC5EE1:   3   Max.   :1340.0     Max.   :40547.70   
##  (Other)                :8148                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0.0              Min.   :   -2.35   Min.   :-664.87  
##  1st Qu.:  661.7              1st Qu.:  635.39   1st Qu.: -83.36  
##  Median : 1589.1              Median : 1227.30   Median : -42.99  
##  Mean   : 2797.0              Mean   : 1630.69   Mean   : -63.71  
##  3rd Qu.: 3526.4              3rd Qu.: 2111.66   3rd Qu.: -20.33  
##  Max.   :25087.7              Max.   :15547.70   Max.   :  32.06  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.: -204.56   1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :  -83.61   Median : 1124.6       Median :  728.7    
##  Mean   : -198.72   Mean   : 2633.5       Mean   : 2395.2    
##  3rd Qu.:  -36.06   3rd Qu.: 3677.9       3rd Qu.: 3304.8    
##  Max.   :   -0.40   Max.   :25000.0       Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :    0.0                 Min.   :0.7013   Min.   : 0.00000  
##  1st Qu.:    0.0                 1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :    0.0                 Median :1.0000   Median : 0.00000  
##  Mean   :  281.7                 Mean   :0.9982   Mean   : 0.07654  
##  3rd Qu.:  130.3                 3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :21117.9                 Max.   :1.0000   Max.   :16.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   :0.00000            Min.   :    0.0             Min.   :  1.00  
##  1st Qu.:0.00000            1st Qu.:    0.0             1st Qu.: 25.00  
##  Median :0.00000            Median :    0.0             Median : 57.00  
##  Mean   :0.03355            Mean   :   33.3             Mean   : 91.74  
##  3rd Qu.:0.00000            3rd Qu.:    0.0             3rd Qu.:121.00  
##  Max.   :9.00000            Max.   :15000.0             Max.   :833.00  
## 
summary(filter(data, LP_NetPrincipalLoss!=0))
##                    ListingKey    ListingNumber    ListingCreationDate 
##  00003546482094282EF90E5:    1   Min.   :    99   Min.   :2006-01-25  
##  00013542762124763F20254:    1   1st Qu.:131376   1st Qu.:2007-05-01  
##  000433785890431972B4743:    1   Median :319181   Median :2008-04-27  
##  0005353671687550573289D:    1   Mean   :324530   Mean   :2009-02-18  
##  001035373445372274F74E2:    1   3rd Qu.:511888   3rd Qu.:2011-06-16  
##  00143395229257559A91663:    1   Max.   :932346   Max.   :2013-09-26  
##  (Other)                :16709                                        
##   CreditGrade        Term                        LoanStatus   
##  HR     :2110   Min.   :12.00   Chargedoff            :11982  
##  C      :2009   1st Qu.:36.00   Defaulted             : 4733  
##  D      :2003   Median :36.00   Cancelled             :    0  
##  E      :1591   Mean   :37.69   Completed             :    0  
##  B      :1384   3rd Qu.:36.00   Current               :    0  
##  (Other):1426   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :6192                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2006-09-05   Min.   :0.00864   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2008-11-13   1st Qu.:0.18698   1st Qu.:0.1703   1st Qu.: 0.1610  
##  Median :2009-12-19   Median :0.25424   Median :0.2375   Median : 0.2250  
##  Mean   :2010-07-23   Mean   :0.25206   Mean   :0.2317   Mean   : 0.2209  
##  3rd Qu.:2012-10-01   3rd Qu.:0.30781   3rd Qu.:0.2900   3rd Qu.: 0.2809  
##  Max.   :2014-03-10   Max.   :0.50633   Max.   :0.4975   Max.   : 0.4800  
##                                                                           
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.182          Min.   :0.006   Min.   :-0.182  
##  1st Qu.: 0.159          1st Qu.:0.087   1st Qu.: 0.111  
##  Median : 0.235          Median :0.112   Median : 0.125  
##  Mean   : 0.217          Mean   :0.116   Mean   : 0.123  
##  3rd Qu.: 0.286          3rd Qu.:0.149   3rd Qu.: 0.144  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :10532           NA's   :10532   NA's   :10532   
##  ProsperRating.num ProsperRating.alpha  ProsperScore   ListingCategory.num
##  Min.   :1.000     D      : 1633       Min.   : 1.00   0      :6635       
##  1st Qu.:2.000     HR     : 1384       1st Qu.: 4.00   1      :4588       
##  Median :3.000     E      : 1294       Median : 5.00   7      :1450       
##  Mean   :2.906     C      :  816       Mean   : 5.42   3      :1387       
##  3rd Qu.:4.000     B      :  581       3rd Qu.: 7.00   2      : 792       
##  Max.   :7.000     (Other):  475       Max.   :10.00   4      : 760       
##  NA's   :10532     NA's   :10532       NA's   :10532   (Other):1103       
##  BorrowerState                     Occupation        EmploymentStatus
##  CA     :2269   Other                   :4670   Full-time    :7416   
##  GA     :1017   Professional            :1671   Employed     :4065   
##  IL     :1012   Clerical                : 699   Not available:2252   
##  FL     : 969   Sales - Commission      : 680   Self-employed:1131   
##  TX     : 911   Administrative Assistant: 659   Other        : 342   
##  (Other):8925   (Other)                 :7529   (Other)      : 702   
##  NA's   :1612   NA's                    : 807   NA's         : 807   
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.00           Mode :logical       Mode :logical   
##  1st Qu.: 19.00           FALSE:9236          FALSE:12103     
##  Median : 51.00           TRUE :7479          TRUE :4612      
##  Mean   : 80.11                                               
##  3rd Qu.:112.00                                               
##  Max.   :755.00                                               
##  NA's   :3062                                                 
##                     GroupKey     DateCreditPulled    
##  783C3371218786870A73D20:  493   Min.   :2005-12-11  
##  FEF83377364176536637E50:  310   1st Qu.:2007-04-24  
##  3D4D3366260257624AB272D:  293   Median :2008-04-22  
##  6A3B336601725506917317E:  275   Mean   :2009-02-14  
##  FE113364863511529673D04:  180   3rd Qu.:2011-06-16  
##  (Other)                : 3302   Max.   :2013-09-26  
##  NA's                   :11862                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1947-08-24     
##  1st Qu.:600.0         1st Qu.:619.0         1st Qu.:1990-07-12     
##  Median :640.0         Median :659.0         Median :1995-09-01     
##  Mean   :640.5         Mean   :659.5         Mean   :1994-09-26     
##  3rd Qu.:700.0         3rd Qu.:719.0         3rd Qu.:1999-11-27     
##  Max.   :860.0         Max.   :879.0         Max.   :2011-08-10     
##  NA's   :173           NA's   :173           NA's   :232            
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 5.000     1st Qu.: 4.000   1st Qu.: 14.00            
##  Median : 8.000     Median : 7.000   Median : 23.00            
##  Mean   : 9.274     Mean   : 8.073   Mean   : 25.03            
##  3rd Qu.:13.000     3rd Qu.:11.000   3rd Qu.: 34.00            
##  Max.   :52.000     Max.   :51.000   Max.   :129.00            
##  NA's   :3060       NA's   :3060     NA's   :232               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 2.000        1st Qu.:   37.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  152.0             Median :  2.000     
##  Mean   : 5.595        Mean   :  319.2             Mean   :  3.008     
##  3rd Qu.: 8.000        3rd Qu.:  400.0             3rd Qu.:  4.000     
##  Max.   :51.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :232         
##  TotalInquiries    CurrentDelinquencies AmountDelinquent
##  Min.   :  0.000   Min.   : 0.000       Min.   :     0  
##  1st Qu.:  3.000   1st Qu.: 0.000       1st Qu.:     0  
##  Median :  6.000   Median : 0.000       Median :     0  
##  Mean   :  9.583   Mean   : 1.615       Mean   :  1422  
##  3rd Qu.: 13.000   3rd Qu.: 1.000       3rd Qu.:   106  
##  Max.   :379.000   Max.   :83.000       Max.   :444745  
##  NA's   :365       NA's   :232          NA's   :3062    
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 6.047          Mean   : 0.4478         
##  3rd Qu.: 7.000          3rd Qu.: 1.0000         
##  Max.   :99.000          Max.   :30.0000         
##  NA's   :341             NA's   :232             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.0000            Min.   :     0         Min.   :0.0000     
##  1st Qu.:0.0000            1st Qu.:  1086         1st Qu.:0.2300     
##  Median :0.0000            Median :  5137         Median :0.6300     
##  Mean   :0.0359            Mean   : 15584         Mean   :0.5659     
##  3rd Qu.:0.0000            3rd Qu.: 15578         3rd Qu.:0.8900     
##  Max.   :7.0000            Max.   :600223         Max.   :4.7300     
##  NA's   :3060              NA's   :3060           NA's   :3060       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  1.00   Min.   :0.000            
##  1st Qu.:   200          1st Qu.: 12.00   1st Qu.:0.710            
##  Median :  1778          Median : 19.00   Median :0.880            
##  Mean   :  7162          Mean   : 21.18   Mean   :0.819            
##  3rd Qu.:  7678          3rd Qu.: 29.00   3rd Qu.:1.000            
##  Max.   :364284          Max.   :118.00   Max.   :1.000            
##  NA's   :3043            NA's   :3043     NA's   :3043             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.000          Min.   : 0.0000   $25,000-49,999:5336  
##  1st Qu.: 0.000          1st Qu.: 0.1381   $50,000-74,999:3442  
##  Median : 1.000          Median : 0.2200   Not displayed :3103  
##  Mean   : 1.124          Mean   : 0.3483   $1-24,999     :1634  
##  3rd Qu.: 2.000          3rd Qu.: 0.3300   $75,000-99,999:1494  
##  Max.   :17.000          Max.   :10.0100   $100,000+     :1269  
##  NA's   :3043            NA's   :1474      (Other)       : 437  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      00023650503696810C531F7:    1  
##  FALSE:1495       1st Qu.:  2500      0004363753221955965B646:    1  
##  TRUE :15220      Median :  3750      000836579711360490B130B:    1  
##                   Mean   :  4452      000B3366346245964D6187E:    1  
##                   3rd Qu.:  5417      000B3656359179267F91999:    1  
##                   Max.   :208333      00193564075967640E1A9A1:    1  
##                                       (Other)                :16709  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.000     Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.000     1st Qu.:  8.00             1st Qu.:  8.00       
##  Median :1.000     Median : 12.00             Median : 12.00       
##  Mean   :1.254     Mean   : 18.59             Mean   : 17.77       
##  3rd Qu.:1.000     3rd Qu.: 25.00             3rd Qu.: 23.00       
##  Max.   :7.000     Max.   :103.00             Max.   :101.00       
##  NA's   :14324     NA's   :14324              NA's   :14324        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   :0.000                  
##  1st Qu.: 0.000                      1st Qu.:0.000                  
##  Median : 0.000                      Median :0.000                  
##  Mean   : 0.753                      Mean   :0.061                  
##  3rd Qu.: 0.000                      3rd Qu.:0.000                  
##  Max.   :26.000                      Max.   :8.000                  
##  NA's   :14324                       NA's   :14324                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0.00           
##  1st Qu.: 3000            1st Qu.:    0.17           
##  Median : 5000            Median : 2045.10           
##  Mean   : 6682            Mean   : 3055.53           
##  3rd Qu.: 8500            3rd Qu.: 4179.37           
##  Max.   :53200            Max.   :22586.67           
##  NA's   :14324            NA's   :14324              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-194.00             Min.   :  16             
##  1st Qu.: -40.00             1st Qu.: 292             
##  Median :  -6.00             Median : 805             
##  Mean   : -10.05             Mean   :1032             
##  3rd Qu.:  19.00             3rd Qu.:1781             
##  Max.   : 214.00             Max.   :2704             
##  NA's   :14325                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 0.00                 Min.   : 5.0               Min.   :    29  
##  1st Qu.: 9.00                 1st Qu.:33.0               1st Qu.: 14850  
##  Median :14.00                 Median :70.0               Median : 30773  
##  Mean   :16.22                 Mean   :60.5               Mean   : 33277  
##  3rd Qu.:22.00                 3rd Qu.:82.0               3rd Qu.: 50746  
##  Max.   :44.00                 Max.   :98.0               Max.   :103467  
##  NA's   :13                                                               
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2006-01-27   Q2 2008:1444          
##  1st Qu.: 2999      1st Qu.:2007-05-11   Q1 2007:1282          
##  Median : 4500      Median :2008-05-08   Q2 2007:1250          
##  Mean   : 6449      Mean   :2009-03-02   Q3 2008:1127          
##  3rd Qu.: 8000      3rd Qu.:2011-06-30   Q3 2007:1032          
##  Max.   :25000      Max.   :2013-10-01   Q4 2006:1023          
##                                          (Other):9557          
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  006C3373804016872128132:    2   Min.   :   0.0     Min.   :   -2.35   
##  009C35078002646985845CF:    2   1st Qu.: 108.6     1st Qu.:  676.90   
##  00C43387968070538859D91:    2   Median : 173.7     Median : 1653.67   
##  018B35275926204010E51B6:    2   Mean   : 236.6     Mean   : 2794.26   
##  01D33386346150055C7F757:    2   3rd Qu.: 309.5     3rd Qu.: 3561.80   
##  01DA3382241797159B9FE89:    2   Max.   :1552.8     Max.   :34021.80   
##  (Other)                :16703                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0.0              Min.   :   -2.35   Min.   :-664.87  
##  1st Qu.:  297.7              1st Qu.:  321.14   1st Qu.: -59.02  
##  Median :  850.3              Median :  727.10   Median : -26.48  
##  Mean   : 1696.4              Mean   : 1097.89   Mean   : -44.63  
##  3rd Qu.: 2051.3              3rd Qu.: 1445.17   3rd Qu.: -10.54  
##  Max.   :24596.4              Max.   :14329.49   Max.   :  32.06  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.:  -17.00   1st Qu.: 1841.3       1st Qu.: 1750.0    
##  Median :    0.00   Median : 3336.2       Median : 3247.4    
##  Mean   :  -67.23   Mean   : 4747.8       Mean   : 4644.9    
##  3rd Qu.:    0.00   3rd Qu.: 6028.8       3rd Qu.: 5909.2    
##  Max.   :    0.00   Max.   :25000.0       Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :    0.0                 Min.   :0.7012   Min.   : 0.00000  
##  1st Qu.:    0.0                 1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :    0.0                 Median :1.0000   Median : 0.00000  
##  Mean   :  164.1                 Mean   :0.9982   Mean   : 0.08268  
##  3rd Qu.:    0.0                 3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :21117.9                 Max.   :1.0000   Max.   :16.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   :0.00000            Min.   :    0.00            Min.   :  1.00  
##  1st Qu.:0.00000            1st Qu.:    0.00            1st Qu.: 27.00  
##  Median :0.00000            Median :    0.00            Median : 60.00  
##  Mean   :0.03362            Mean   :   27.64            Mean   : 97.67  
##  3rd Qu.:0.00000            3rd Qu.:    0.00            3rd Qu.:129.00  
##  Max.   :9.00000            Max.   :12500.00            Max.   :881.00  
## 
summary(filter(data, LP_NonPrincipalRecoverypayments!=0))
##                    ListingKey   ListingNumber    ListingCreationDate 
##  0005353671687550573289D:   1   Min.   :    99   Min.   :2006-01-25  
##  001035373445372274F74E2:   1   1st Qu.:141414   1st Qu.:2007-05-22  
##  00293413955892317967503:   1   Median :295332   Median :2008-03-18  
##  00433419411531491904742:   1   Mean   :299412   Mean   :2008-10-07  
##  005E35068034002701D1E8F:   1   3rd Qu.:448293   3rd Qu.:2010-03-01  
##  007E35498620125415AF2FF:   1   Max.   :813315   Max.   :2013-06-18  
##  (Other)                :3255                                        
##   CreditGrade       Term                       LoanStatus  
##  C      :492   Min.   :12.0   Chargedoff            :2048  
##  D      :482   1st Qu.:36.0   Defaulted             :1213  
##  B      :354   Median :36.0   Cancelled             :   0  
##  E      :351   Mean   :36.8   Completed             :   0  
##  HR     :336   3rd Qu.:36.0   Current               :   0  
##  (Other):324   Max.   :60.0   FinalPaymentInProgress:   0  
##  NA's   :922                  (Other)               :   0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2006-09-22   Min.   :0.01315   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2009-01-07   1st Qu.:0.17722   1st Qu.:0.1650   1st Qu.: 0.1530  
##  Median :2009-12-16   Median :0.24264   Median :0.2300   Median : 0.2169  
##  Mean   :2010-05-21   Mean   :0.24646   Mean   :0.2275   Mean   : 0.2167  
##  3rd Qu.:2011-12-02   3rd Qu.:0.30564   3rd Qu.:0.2900   3rd Qu.: 0.2800  
##  Max.   :2014-02-28   Max.   :0.41355   Max.   :0.3600   Max.   : 0.3525  
##                                                                           
##  EstimatedEffectiveYield EstimatedLoss    EstimatedReturn  
##  Min.   :-0.0508         Min.   :0.0060   Min.   :-0.0508  
##  1st Qu.: 0.1496         1st Qu.:0.0890   1st Qu.: 0.1124  
##  Median : 0.2377         Median :0.1120   Median : 0.1304  
##  Mean   : 0.2131         Mean   :0.1189   Mean   : 0.1260  
##  3rd Qu.: 0.2861         3rd Qu.:0.1490   3rd Qu.: 0.1463  
##  Max.   : 0.3199         Max.   :0.3660   Max.   : 0.2230  
##  NA's   :2339            NA's   :2339     NA's   :2339     
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      : 265        Min.   : 1.000  
##  1st Qu.:2.000     E      : 210        1st Qu.: 4.000  
##  Median :3.000     HR     : 206        Median : 5.000  
##  Mean   :2.807     C      : 110        Mean   : 5.397  
##  3rd Qu.:4.000     B      :  67        3rd Qu.: 7.000  
##  Max.   :7.000     (Other):  64        Max.   :10.000  
##  NA's   :2339      NA's   :2339        NA's   :2339    
##  ListingCategory.num BorrowerState               Occupation  
##  0      :1342        CA     : 397   Other             : 824  
##  1      : 899        GA     : 211   Professional      : 372  
##  7      : 284        IL     : 204   Teacher           : 156  
##  3      : 225        FL     : 134   Clerical          : 141  
##  4      : 198        NY     : 134   Sales - Commission: 134  
##  2      : 136        (Other):1834   (Other)           :1521  
##  (Other): 177        NA's   : 347   NA's              : 113  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :1839    Min.   :  0.0            Mode :logical      
##  Employed     : 546    1st Qu.: 20.0            FALSE:1711         
##  Not available: 368    Median : 53.0            TRUE :1550         
##  Self-employed: 195    Mean   : 83.1                               
##  Part-time    :  77    3rd Qu.:117.0                               
##  (Other)      : 123    Max.   :573.0                               
##  NA's         : 113    NA's   :482                                 
##  CurrentlyInGroup                    GroupKey    DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20:  87   Min.   :2005-12-11  
##  FALSE:2296       FEF83377364176536637E50:  78   1st Qu.:2007-05-16  
##  TRUE :965        3D4D3366260257624AB272D:  43   Median :2008-03-14  
##                   9BBE337094173775621CD34:  43   Mean   :2008-10-03  
##                   FE113364863511529673D04:  34   3rd Qu.:2010-02-22  
##                   (Other)                : 703   Max.   :2013-06-18  
##                   NA's                   :2273                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1958-02-01     
##  1st Qu.:600.0         1st Qu.:619.0         1st Qu.:1989-04-01     
##  Median :640.0         Median :659.0         Median :1994-09-21     
##  Mean   :642.7         Mean   :661.7         Mean   :1993-10-05     
##  3rd Qu.:700.0         3rd Qu.:719.0         3rd Qu.:1999-01-07     
##  Max.   :860.0         Max.   :879.0         Max.   :2010-12-02     
##  NA's   :24            NA's   :24            NA's   :28             
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 5.000     1st Qu.: 4.000   1st Qu.: 15.00            
##  Median : 8.000     Median : 7.000   Median : 25.00            
##  Mean   : 9.275     Mean   : 7.878   Mean   : 26.67            
##  3rd Qu.:13.000     3rd Qu.:11.000   3rd Qu.: 36.00            
##  Max.   :45.000     Max.   :39.000   Max.   :120.00            
##  NA's   :481        NA's   :481      NA's   :28                
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :   0.0              Min.   : 0.000      
##  1st Qu.: 2.000        1st Qu.:  40.0              1st Qu.: 1.000      
##  Median : 5.000        Median : 144.0              Median : 2.000      
##  Mean   : 5.505        Mean   : 319.7              Mean   : 2.962      
##  3rd Qu.: 8.000        3rd Qu.: 391.0              3rd Qu.: 4.000      
##  Max.   :40.000        Max.   :5853.0              Max.   :97.000      
##                                                    NA's   :28          
##  TotalInquiries    CurrentDelinquencies AmountDelinquent
##  Min.   :  0.000   Min.   : 0.000       Min.   :     0  
##  1st Qu.:  3.000   1st Qu.: 0.000       1st Qu.:     0  
##  Median :  7.000   Median : 0.000       Median :     0  
##  Mean   :  9.803   Mean   : 1.393       Mean   :  1347  
##  3rd Qu.: 13.000   3rd Qu.: 1.000       3rd Qu.:   179  
##  Max.   :379.000   Max.   :64.000       Max.   :215315  
##  NA's   :52        NA's   :28           NA's   :481     
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 1.000          Median : 0.0000         
##  Mean   : 7.099          Mean   : 0.4476         
##  3rd Qu.: 9.000          3rd Qu.: 1.0000         
##  Max.   :99.000          Max.   :17.0000         
##  NA's   :47              NA's   :28              
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.0000            Min.   :     0.0       Min.   :0.0000     
##  1st Qu.:0.0000            1st Qu.:   858.8       1st Qu.:0.2375     
##  Median :0.0000            Median :  4347.5       Median :0.6500     
##  Mean   :0.0432            Mean   : 14841.3       Mean   :0.5777     
##  3rd Qu.:0.0000            3rd Qu.: 14215.0       3rd Qu.:0.9100     
##  Max.   :7.0000            Max.   :493300.0       Max.   :4.7300     
##  NA's   :481               NA's   :481            NA's   :481        
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0.0        Min.   :  1.00   Min.   :0.0000           
##  1st Qu.:   126.2        1st Qu.: 12.00   1st Qu.:0.6600           
##  Median :  1132.5        Median : 20.00   Median :0.8200           
##  Mean   :  6107.1        Mean   : 22.34   Mean   :0.7819           
##  3rd Qu.:  6000.0        3rd Qu.: 30.75   3rd Qu.:0.9500           
##  Max.   :364284.0        Max.   :118.00   Max.   :1.0000           
##  NA's   :475             NA's   :475      NA's   :475              
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.000          Min.   : 0.0100   $25,000-49,999:1036  
##  1st Qu.: 0.000          1st Qu.: 0.1400   $50,000-74,999: 730  
##  Median : 1.000          Median : 0.2200   Not displayed : 494  
##  Mean   : 1.107          Mean   : 0.3263   $75,000-99,999: 344  
##  3rd Qu.: 2.000          3rd Qu.: 0.3400   $1-24,999     : 324  
##  Max.   :13.000          Max.   :10.0100   $100,000+     : 259  
##  NA's   :475             NA's   :224       (Other)       :  74  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey    
##  Mode :logical    Min.   :     0      000B3366346245964D6187E:   1  
##  FALSE:228        1st Qu.:  2625      002E362825862155611E637:   1  
##  TRUE :3033       Median :  4000      0038364851207507437AE49:   1  
##                   Mean   :  4620      005D36460825304664D7BC1:   1  
##                   3rd Qu.:  5792      00803393723192256BBDBA4:   1  
##                   Max.   :208333      008A366194593193999BEE4:   1  
##                                       (Other)                :3255  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.000     Min.   : 0.00              Min.   : 0.00        
##  1st Qu.:1.000     1st Qu.: 8.00              1st Qu.: 7.00        
##  Median :1.000     Median :12.00              Median :12.00        
##  Mean   :1.177     Mean   :17.82              Mean   :16.67        
##  3rd Qu.:1.000     3rd Qu.:23.00              3rd Qu.:21.00        
##  Max.   :5.000     Max.   :90.00              Max.   :90.00        
##  NA's   :2799      NA's   :2799               NA's   :2799         
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   :0.0000                 
##  1st Qu.: 0.000                      1st Qu.:0.0000                 
##  Median : 0.000                      Median :0.0000                 
##  Mean   : 1.069                      Mean   :0.0823                 
##  3rd Qu.: 1.000                      3rd Qu.:0.0000                 
##  Max.   :24.000                      Max.   :7.0000                 
##  NA's   :2799                        NA's   :2799                   
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0.0            
##  1st Qu.: 2550            1st Qu.:  321.6            
##  Median : 4022            Median : 1982.3            
##  Mean   : 5702            Mean   : 2746.8            
##  3rd Qu.: 7150            3rd Qu.: 3916.9            
##  Max.   :30000            Max.   :20946.7            
##  NA's   :2799             NA's   :2799               
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-160.0000           Min.   :  16             
##  1st Qu.: -36.0000           1st Qu.: 368             
##  Median :   0.0000           Median : 951             
##  Mean   :   0.8351           Mean   :1042             
##  3rd Qu.:  37.0000           3rd Qu.:1688             
##  Max.   : 214.0000           Max.   :2613             
##  NA's   :2800                                         
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber   
##  Min.   : 0.00                 Min.   : 9.00              Min.   :   29  
##  1st Qu.: 8.00                 1st Qu.:48.00              1st Qu.:15589  
##  Median :14.00                 Median :72.00              Median :29091  
##  Mean   :15.79                 Mean   :64.91              Mean   :29839  
##  3rd Qu.:23.00                 3rd Qu.:81.00              3rd Qu.:41253  
##  Max.   :41.00                 Max.   :98.00              Max.   :93727  
##                                                                          
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2006-01-27   Q2 2008: 375          
##  1st Qu.: 3000      1st Qu.:2007-06-01   Q2 2007: 310          
##  Median : 5000      Median :2008-03-31   Q3 2008: 303          
##  Mean   : 6575      Mean   :2008-10-19   Q1 2008: 255          
##  3rd Qu.: 8000      3rd Qu.:2010-03-17   Q3 2007: 240          
##  Max.   :25000      Max.   :2013-06-20   Q1 2007: 233          
##                                          (Other):1545          
##                    MemberKey    MonthlyLoanPayment LP_CustomerPayments
##  00C43387968070538859D91:   2   Min.   :   0.0     Min.   :    0      
##  030E3403407292850D85CBC:   2   1st Qu.: 105.1     1st Qu.:  962      
##  03863429108114327FA5713:   2   Median : 173.7     Median : 2169      
##  03F43394048903402B10A91:   2   Mean   : 235.3     Mean   : 3456      
##  04583480851747351D652F4:   2   3rd Qu.: 309.3     3rd Qu.: 4365      
##  049233823972095378B0176:   2   Max.   :1130.9     Max.   :34022      
##  (Other)                :3249                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0.0              Min.   :    0.0    Min.   :-664.87  
##  1st Qu.:  417.9              1st Qu.:  453.3    1st Qu.: -73.69  
##  Median : 1121.9              Median :  937.6    Median : -35.93  
##  Mean   : 2125.9              Mean   : 1330.4    Mean   : -56.06  
##  3rd Qu.: 2535.7              3rd Qu.: 1713.5    3rd Qu.: -16.02  
##  Max.   :24939.2              Max.   :14329.5    Max.   :  32.06  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.: -372.00   1st Qu.: 1663.6       1st Qu.:  939.7    
##  Median : -104.77   Median : 3109.4       Median : 2560.9    
##  Mean   : -300.52   Mean   : 4434.9       Mean   : 3759.6    
##  3rd Qu.:  -17.52   3rd Qu.: 5669.3       3rd Qu.: 4850.2    
##  Max.   :    0.00   Max.   :25000.0       Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :    0.04                Min.   :0.7013   Min.   : 0.00000  
##  1st Qu.:  132.00                1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :  387.82                Median :1.0000   Median : 0.00000  
##  Mean   :  878.47                Mean   :0.9985   Mean   : 0.09874  
##  3rd Qu.: 1025.00                3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :21117.90                Max.   :1.0000   Max.   :16.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors    
##  Min.   :0.00000            Min.   :    0.00            Min.   :  1.0  
##  1st Qu.:0.00000            1st Qu.:    0.00            1st Qu.: 32.0  
##  Median :0.00000            Median :    0.00            Median : 66.0  
##  Mean   :0.04293            Mean   :   35.84            Mean   :106.6  
##  3rd Qu.:0.00000            3rd Qu.:    0.00            3rd Qu.:142.0  
##  Max.   :7.00000            Max.   :12500.00            Max.   :821.0  
## 
summary(filter(data, !is.na(ClosedDate) & LP_NonPrincipalRecoverypayments==0))
##                    ListingKey    ListingNumber     ListingCreationDate 
##  018A360063948152589C8BE:    2   Min.   :      4   Min.   :2005-11-09  
##  30F435938764424435A1188:    2   1st Qu.: 190746   1st Qu.:2007-08-21  
##  32943590099161153292459:    2   Median : 396320   Median :2008-09-10  
##  6DFC3591891372387BB41B2:    2   Mean   : 373438   Mean   :2009-08-01  
##  778D35919242972923313E0:    2   3rd Qu.: 528024   3rd Qu.:2011-09-19  
##  82FD35914405776692938D4:    2   Max.   :1204824   Max.   :2014-02-13  
##  (Other)                :51816                                         
##   CreditGrade         Term                        LoanStatus   
##  C      : 5157   Min.   :12.00   Completed             :38074  
##  D      : 4671   1st Qu.:36.00   Chargedoff            : 9944  
##  B      : 4035   Median :36.00   Defaulted             : 3805  
##  AA     : 3385   Mean   :36.95   Cancelled             :    5  
##  HR     : 3172   3rd Qu.:36.00   Current               :    0  
##  (Other): 6194   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :25214                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2005-11-25   Min.   :0.00653   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2009-08-06   1st Qu.:0.14709   1st Qu.:0.1315   1st Qu.: 0.1225  
##  Median :2011-05-03   Median :0.21223   Median :0.1900   Median : 0.1800  
##  Mean   :2011-03-26   Mean   :0.22067   Mean   :0.1987   Mean   : 0.1886  
##  3rd Qu.:2013-02-14   3rd Qu.:0.29510   3rd Qu.:0.2669   3rd Qu.: 0.2545  
##  Max.   :2014-03-10   Max.   :0.51229   Max.   :0.4975   Max.   : 0.4925  
##                       NA's   :25                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.183          Min.   :0.005   Min.   :-0.183  
##  1st Qu.: 0.108          1st Qu.:0.050   1st Qu.: 0.077  
##  Median : 0.170          Median :0.096   Median : 0.112  
##  Mean   : 0.175          Mean   :0.093   Mean   : 0.107  
##  3rd Qu.: 0.245          3rd Qu.:0.140   3rd Qu.: 0.136  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :26745           NA's   :26745   NA's   :26745   
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      : 5604       Min.   : 1.000  
##  1st Qu.:2.000     C      : 3707       1st Qu.: 5.000  
##  Median :3.000     E      : 3620       Median : 6.000  
##  Mean   :3.694     A      : 3552       Mean   : 6.298  
##  3rd Qu.:5.000     HR     : 3519       3rd Qu.: 8.000  
##  Max.   :7.000     (Other): 5081       Max.   :11.000  
##  NA's   :26745     NA's   :26745       NA's   :26745   
##  ListingCategory.num BorrowerState                      Occupation   
##  1      :16969       CA     : 6866   Other                   :13232  
##  0      :15610       FL     : 2944   Professional            : 6143  
##  7      : 5758       IL     : 2835   Computer Programmer     : 2399  
##  3      : 4032       TX     : 2638   Administrative Assistant: 1807  
##  2      : 3108       GA     : 2572   Analyst                 : 1715  
##  4      : 2197       (Other):28805   (Other)                 :24377  
##  (Other): 4154       NA's   : 5168   NA's                    : 2155  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :23119   Min.   :  0.00           Mode :logical      
##  Employed     :15945   1st Qu.: 21.00           FALSE:27491        
##  Not available: 4979   Median : 52.00           TRUE :24337        
##  Self-employed: 2731   Mean   : 80.76                              
##  Part-time    :  979   3rd Qu.:112.00                              
##  (Other)      : 1933   Max.   :755.00                              
##  NA's         : 2142   NA's   :7133                                
##  CurrentlyInGroup                    GroupKey     DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20:  974   Min.   :2005-11-09  
##  FALSE:40949      3D4D3366260257624AB272D:  763   1st Qu.:2007-08-15  
##  TRUE :10879      6A3B336601725506917317E:  641   Median :2008-09-09  
##                   FEF83377364176536637E50:  504   Mean   :2009-08-01  
##                   C9643379247860156A00EC0:  319   3rd Qu.:2011-09-19  
##                   (Other)                : 8480   Max.   :2014-02-13  
##                   NA's                   :40147                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1947-08-24     
##  1st Qu.:640.0         1st Qu.:659.0         1st Qu.:1990-10-31     
##  Median :680.0         Median :699.0         Median :1995-11-01     
##  Mean   :673.6         Mean   :692.6         Mean   :1994-12-27     
##  3rd Qu.:720.0         3rd Qu.:739.0         3rd Qu.:2000-01-25     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-06-19     
##  NA's   :567           NA's   :567           NA's   :669            
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 6.000     1st Qu.: 5.000   1st Qu.: 15.00            
##  Median : 9.000     Median : 8.000   Median : 23.00            
##  Mean   : 9.587     Mean   : 8.366   Mean   : 25.19            
##  3rd Qu.:13.000     3rd Qu.:11.000   3rd Qu.: 33.00            
##  Max.   :59.000     Max.   :51.000   Max.   :136.00            
##  NA's   :7123       NA's   :7123     NA's   :669               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 3.000        1st Qu.:   61.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  185.0             Median :  1.000     
##  Mean   : 6.114        Mean   :  325.6             Mean   :  1.994     
##  3rd Qu.: 8.000        3rd Qu.:  419.0             3rd Qu.:  3.000     
##  Max.   :51.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :669         
##  TotalInquiries CurrentDelinquencies AmountDelinquent
##  Min.   :  0    Min.   : 0.0000      Min.   :     0  
##  1st Qu.:  2    1st Qu.: 0.0000      1st Qu.:     0  
##  Median :  5    Median : 0.0000      Median :     0  
##  Mean   :  7    Mean   : 0.8757      Mean   :  1032  
##  3rd Qu.:  9    3rd Qu.: 1.0000      3rd Qu.:     0  
##  Max.   :377    Max.   :83.0000      Max.   :444745  
##  NA's   :1107   NA's   :669          NA's   :7141    
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 4.412          Mean   : 0.3237         
##  3rd Qu.: 4.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :30.0000         
##  NA's   :943             NA's   :669             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.000             Min.   :      0        Min.   :0.000      
##  1st Qu.:0.000             1st Qu.:   1677        1st Qu.:0.210      
##  Median :0.000             Median :   6170        Median :0.560      
##  Mean   :0.023             Mean   :  15742        Mean   :0.527      
##  3rd Qu.:0.000             3rd Qu.:  16306        3rd Qu.:0.840      
##  Max.   :7.000             Max.   :1435667        Max.   :5.950      
##  NA's   :7123              NA's   :7123           NA's   :7123       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  0.00   Min.   :0.000            
##  1st Qu.:   546          1st Qu.: 13.00   1st Qu.:0.780            
##  Median :  3439          Median : 20.00   Median :0.920            
##  Mean   : 11139          Mean   : 21.75   Mean   :0.861            
##  3rd Qu.: 12452          3rd Qu.: 29.00   3rd Qu.:1.000            
##  Max.   :646285          Max.   :126.00   Max.   :1.000            
##  NA's   :7069            NA's   :7069     NA's   :7069             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange   
##  Min.   : 0.000          Min.   : 0.000    $25,000-49,999:15308  
##  1st Qu.: 0.000          1st Qu.: 0.130    $50,000-74,999:12059  
##  Median : 1.000          Median : 0.200    Not displayed : 7247  
##  Mean   : 0.897          Mean   : 0.288    $75,000-99,999: 6098  
##  3rd Qu.: 1.000          3rd Qu.: 0.300    $100,000+     : 5805  
##  Max.   :20.000          Max.   :10.010    $1-24,999     : 4247  
##  NA's   :7069            NA's   :4006      (Other)       : 1064  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      08C43696561586194AC381C:    2  
##  FALSE:4073       1st Qu.:  2833      09303699897852595CD59DD:    2  
##  TRUE :47755      Median :  4167      114D37056655628721BD6C8:    2  
##                   Mean   :  5082      156836977849742636AE34F:    2  
##                   3rd Qu.:  6250      56D73700259224545E36FBC:    2  
##                   Max.   :618548      63113695530739927C7EA06:    2  
##                                       (Other)                :51816  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :0.00      Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.00      1st Qu.:  9.00             1st Qu.:  9.00       
##  Median :1.00      Median : 15.00             Median : 15.00       
##  Mean   :1.34      Mean   : 20.18             Mean   : 19.62       
##  3rd Qu.:2.00      3rd Qu.: 29.00             3rd Qu.: 28.00       
##  Max.   :7.00      Max.   :120.00             Max.   :114.00       
##  NA's   :41751     NA's   :41751              NA's   :41751        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.00                       Min.   : 0.00                  
##  1st Qu.: 0.00                       1st Qu.: 0.00                  
##  Median : 0.00                       Median : 0.00                  
##  Mean   : 0.52                       Mean   : 0.05                  
##  3rd Qu.: 0.00                       3rd Qu.: 0.00                  
##  Max.   :42.00                       Max.   :21.00                  
##  NA's   :41751                       NA's   :41751                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   :    0            Min.   :    0              
##  1st Qu.: 3000            1st Qu.:    0              
##  Median : 5000            Median : 1043              
##  Mean   : 7169            Mean   : 2314              
##  3rd Qu.: 9500            3rd Qu.: 3362              
##  Max.   :60001            Max.   :22587              
##  NA's   :41751            NA's   :41751              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-194.00             Min.   :   0.0           
##  1st Qu.: -32.00             1st Qu.:   0.0           
##  Median :   0.00             Median :   0.0           
##  Mean   :   1.16             Mean   : 268.7           
##  3rd Qu.:  32.00             3rd Qu.: 154.0           
##  Max.   : 286.00             Max.   :2704.0           
##  NA's   :41847                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 1.00                 Min.   :  1.00             Min.   :     1  
##  1st Qu.:10.00                 1st Qu.: 30.00             1st Qu.: 18724  
##  Median :14.00                 Median : 66.00             Median : 37028  
##  Mean   :16.38                 Mean   : 55.05             Mean   : 38445  
##  3rd Qu.:22.00                 3rd Qu.: 78.00             3rd Qu.: 53926  
##  Max.   :44.00                 Max.   :100.00             Max.   :132453  
##  NA's   :38145                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-15   Q2 2008: 3969         
##  1st Qu.: 2600      1st Qu.:2007-09-04   Q3 2008: 3299         
##  Median : 4500      Median :2008-09-25   Q1 2007: 2846         
##  Mean   : 6242      Mean   :2009-08-15   Q1 2008: 2819         
##  3rd Qu.: 8000      3rd Qu.:2011-09-29   Q2 2007: 2808         
##  Max.   :35000      Max.   :2014-02-21   (Other):36065         
##                                          NA's   :   22         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  16083364744933457E57FB9:    8   Min.   :   0.00    Min.   :   -2.35   
##  63CA34120866140639431C9:    8   1st Qu.:  98.03    1st Qu.: 2111.43   
##  739C338135235294782AE75:    8   Median : 172.57    Median : 4354.66   
##  7E1733653050264822FAA3D:    8   Mean   : 222.99    Mean   : 6082.62   
##  C70934206057523078260C7:    8   3rd Qu.: 299.71    3rd Qu.: 8166.37   
##  458E33818543661332BC1BE:    7   Max.   :2251.51    Max.   :40702.39   
##  (Other)                :51781                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0                Min.   :   -2.35   Min.   :-589.95  
##  1st Qu.: 1500                1st Qu.:  324.20   1st Qu.: -72.99  
##  Median : 3500                Median :  753.92   Median : -34.16  
##  Mean   : 4968                Mean   : 1114.18   Mean   : -53.51  
##  3rd Qu.: 7000                3rd Qu.: 1496.82   3rd Qu.: -14.57  
##  Max.   :35000                Max.   :15617.03   Max.   :   2.87  
##                                                                   
##  LP_CollectionFees   LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-1996.860   Min.   :    0.0       Min.   : -474.3    
##  1st Qu.:    0.000   1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :    0.000   Median :    0.0       Median :    0.0    
##  Mean   :   -7.666   Mean   : 1260.8       Mean   : 1261.5    
##  3rd Qu.:    0.000   3rd Qu.:  662.6       3rd Qu.:  664.3    
##  Max.   :    0.000   Max.   :25000.0       Max.   :25000.0    
##                                                               
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :0                       Min.   :0.7000   Min.   : 0.00000  
##  1st Qu.:0                       1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :0                       Median :1.0000   Median : 0.00000  
##  Mean   :0                       Mean   :0.9986   Mean   : 0.08879  
##  3rd Qu.:0                       3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :0                       Max.   :1.0110   Max.   :39.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   : 0.00000           Min.   :    0.00            Min.   :   1.0  
##  1st Qu.: 0.00000           1st Qu.:    0.00            1st Qu.:  31.0  
##  Median : 0.00000           Median :    0.00            Median :  69.0  
##  Mean   : 0.04631           Mean   :   33.44            Mean   : 104.8  
##  3rd Qu.: 0.00000           3rd Qu.:    0.00            3rd Qu.: 142.0  
##  Max.   :33.00000           Max.   :25000.00            Max.   :1189.0  
## 
summary(filter(data, !is.na(ClosedDate) & LP_NonPrincipalRecoverypayments==0 & LoanStatus=="Completed"))
##                    ListingKey    ListingNumber     ListingCreationDate 
##  018A360063948152589C8BE:    2   Min.   :      4   Min.   :2005-11-09  
##  30F435938764424435A1188:    2   1st Qu.: 221154   1st Qu.:2007-10-25  
##  32943590099161153292459:    2   Median : 425476   Median :2009-09-22  
##  6DFC3591891372387BB41B2:    2   Mean   : 388420   Mean   :2009-09-15  
##  778D35919242972923313E0:    2   3rd Qu.: 529268   3rd Qu.:2011-09-24  
##  82FD35914405776692938D4:    2   Max.   :1204824   Max.   :2014-02-13  
##  (Other)                :38062                                         
##   CreditGrade         Term                        LoanStatus   
##  C      : 3609   Min.   :12.00   Completed             :38074  
##  D      : 3126   1st Qu.:36.00   Cancelled             :    0  
##  B      : 2987   Median :36.00   Chargedoff            :    0  
##  AA     : 2969   Mean   :36.61   Current               :    0  
##  A      : 2505   3rd Qu.:36.00   Defaulted             :    0  
##  (Other): 3092   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :19786                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2005-11-25   Min.   :0.00653   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2010-01-28   1st Qu.:0.13271   1st Qu.:0.1173   1st Qu.: 0.1080  
##  Median :2011-07-14   Median :0.19479   Median :0.1744   Median : 0.1644  
##  Mean   :2011-06-14   Mean   :0.20878   Mean   :0.1864   Mean   : 0.1766  
##  3rd Qu.:2013-03-06   3rd Qu.:0.28498   3rd Qu.:0.2511   3rd Qu.: 0.2411  
##  Max.   :2014-03-04   Max.   :0.51229   Max.   :0.4975   Max.   : 0.4925  
##                       NA's   :25                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.183          Min.   :0.005   Min.   :-0.183  
##  1st Qu.: 0.093          1st Qu.:0.041   1st Qu.: 0.072  
##  Median : 0.154          Median :0.085   Median : 0.107  
##  Mean   : 0.163          Mean   :0.087   Mean   : 0.102  
##  3rd Qu.: 0.234          3rd Qu.:0.119   3rd Qu.: 0.132  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.267  
##  NA's   :18410           NA's   :18410   NA's   :18410   
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      : 4192       Min.   : 1.000  
##  1st Qu.:3.000     A      : 3203       1st Qu.: 5.000  
##  Median :4.000     C      : 2977       Median : 7.000  
##  Mean   :3.908     B      : 2785       Mean   : 6.537  
##  3rd Qu.:5.000     E      : 2506       3rd Qu.: 8.000  
##  Max.   :7.000     (Other): 4001       Max.   :11.000  
##  NA's   :18410     NA's   :18410       NA's   :18410   
##  ListingCategory.num BorrowerState                 Occupation   
##  1      :13167       CA     : 4957   Other              : 9315  
##  0      :10253       FL     : 2094   Professional       : 4804  
##  7      : 4551       IL     : 2017   Computer Programmer: 2073  
##  3      : 2848       NY     : 1935   Analyst            : 1451  
##  2      : 2434       TX     : 1831   Executive          : 1294  
##  4      : 1620       (Other):21357   (Other)            :17685  
##  (Other): 3201       NA's   : 3883   NA's               : 1452  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :17397   Min.   :  0.00           Mode :logical      
##  Employed     :12332   1st Qu.: 21.00           FALSE:19794        
##  Not available: 3077   Median : 53.00           TRUE :18280        
##  Self-employed: 1783   Mean   : 81.26                              
##  Part-time    :  794   3rd Qu.:112.00                              
##  (Other)      : 1252   Max.   :745.00                              
##  NA's         : 1439   NA's   :4526                                
##  CurrentlyInGroup                    GroupKey     DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20:  562   Min.   :2005-11-09  
##  FALSE:30898      3D4D3366260257624AB272D:  512   1st Qu.:2007-10-22  
##  TRUE :7176       6A3B336601725506917317E:  396   Median :2009-10-01  
##                   FEF83377364176536637E50:  268   Mean   :2009-09-17  
##                   CC8D33653247904019A9059:  258   3rd Qu.:2011-09-26  
##                   (Other)                : 5762   Max.   :2014-02-13  
##                   NA's                   :30316                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1950-08-01     
##  1st Qu.:640.0         1st Qu.:659.0         1st Qu.:1990-10-31     
##  Median :680.0         Median :699.0         Median :1995-11-01     
##  Mean   :685.6         Mean   :704.6         Mean   :1994-12-30     
##  3rd Qu.:740.0         3rd Qu.:759.0         3rd Qu.:2000-01-21     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-06-19     
##  NA's   :416           NA's   :416           NA's   :463            
##  CurrentCreditLines OpenCreditLines TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.00   Min.   :  2.00            
##  1st Qu.: 6.000     1st Qu.: 5.00   1st Qu.: 15.00            
##  Median : 9.000     Median : 8.00   Median : 23.00            
##  Mean   : 9.692     Mean   : 8.45   Mean   : 25.39            
##  3rd Qu.:13.000     3rd Qu.:11.00   3rd Qu.: 33.00            
##  Max.   :59.000     Max.   :48.00   Max.   :136.00            
##  NA's   :4517       NA's   :4517    NA's   :463               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0               Min.   : 0.000      
##  1st Qu.: 3.000        1st Qu.:   71               1st Qu.: 0.000      
##  Median : 5.000        Median :  196               Median : 1.000      
##  Mean   : 6.294        Mean   :  328               Mean   : 1.633      
##  3rd Qu.: 8.000        3rd Qu.:  425               3rd Qu.: 2.000      
##  Max.   :49.000        Max.   :12769               Max.   :63.000      
##                                                    NA's   :463         
##  TotalInquiries    CurrentDelinquencies AmountDelinquent  
##  Min.   :  0.000   Min.   : 0.0000      Min.   :     0.0  
##  1st Qu.:  2.000   1st Qu.: 0.0000      1st Qu.:     0.0  
##  Median :  4.000   Median : 0.0000      Median :     0.0  
##  Mean   :  6.109   Mean   : 0.5958      Mean   :   895.4  
##  3rd Qu.:  8.000   3rd Qu.: 0.0000      3rd Qu.:     0.0  
##  Max.   :113.000   Max.   :50.0000      Max.   :327677.0  
##  NA's   :786       NA's   :463          NA's   :4533      
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 3.925          Mean   : 0.2801         
##  3rd Qu.: 3.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :21.0000         
##  NA's   :644             NA's   :463             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.000             Min.   :      0        Min.   :0.000      
##  1st Qu.:0.000             1st Qu.:   1867        1st Qu.:0.210      
##  Median :0.000             Median :   6455        Median :0.540      
##  Mean   :0.019             Mean   :  15743        Mean   :0.516      
##  3rd Qu.:0.000             3rd Qu.:  16368        3rd Qu.:0.830      
##  Max.   :4.000             Max.   :1435667        Max.   :5.950      
##  NA's   :4517              NA's   :4517           NA's   :4517       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  0.00   Min.   :0.000            
##  1st Qu.:   740          1st Qu.: 13.00   1st Qu.:0.800            
##  Median :  4111          Median : 20.00   Median :0.930            
##  Mean   : 12385          Mean   : 22.04   Mean   :0.872            
##  3rd Qu.: 14172          3rd Qu.: 29.00   3rd Qu.:1.000            
##  Max.   :646285          Max.   :126.00   Max.   :1.000            
##  NA's   :4474            NA's   :4474     NA's   :4474             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange   
##  Min.   : 0.000          Min.   : 0.0000   $25,000-49,999:10891  
##  1st Qu.: 0.000          1st Qu.: 0.1200   $50,000-74,999: 9282  
##  Median : 0.000          Median : 0.1900   $75,000-99,999: 4914  
##  Mean   : 0.822          Mean   : 0.2642   $100,000+     : 4774  
##  3rd Qu.: 1.000          3rd Qu.: 0.2900   Not displayed : 4610  
##  Max.   :20.000          Max.   :10.0100   $1-24,999     : 2908  
##  NA's   :4474            NA's   :2734      (Other)       :  695  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      08C43696561586194AC381C:    2  
##  FALSE:2782       1st Qu.:  2917      09303699897852595CD59DD:    2  
##  TRUE :35292      Median :  4417      114D37056655628721BD6C8:    2  
##                   Mean   :  5324      156836977849742636AE34F:    2  
##                   3rd Qu.:  6583      56D73700259224545E36FBC:    2  
##                   Max.   :618548      63113695530739927C7EA06:    2  
##                                       (Other)                :38062  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :0.000     Min.   :  0.0              Min.   :  0.00       
##  1st Qu.:1.000     1st Qu.:  9.0              1st Qu.:  9.00       
##  Median :1.000     Median : 16.0              Median : 15.00       
##  Mean   :1.357     Mean   : 20.5              Mean   : 19.98       
##  3rd Qu.:2.000     3rd Qu.: 29.0              3rd Qu.: 28.00       
##  Max.   :7.000     Max.   :120.0              Max.   :114.00       
##  NA's   :29989     NA's   :29989              NA's   :29989        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   : 0.000                 
##  1st Qu.: 0.000                      1st Qu.: 0.000                 
##  Median : 0.000                      Median : 0.000                 
##  Mean   : 0.479                      Mean   : 0.041                 
##  3rd Qu.: 0.000                      3rd Qu.: 0.000                 
##  Max.   :42.000                      Max.   :21.000                 
##  NA's   :29989                       NA's   :29989                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   :    0            Min.   :    0.0            
##  1st Qu.: 3000            1st Qu.:    0.0            
##  Median : 5000            Median :  817.7            
##  Mean   : 7238            Mean   : 2120.9            
##  3rd Qu.: 9750            3rd Qu.: 3104.2            
##  Max.   :60001            Max.   :22538.1            
##  NA's   :29989            NA's   :29989              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-175.000            Min.   :0                
##  1st Qu.: -27.000            1st Qu.:0                
##  Median :   0.000            Median :0                
##  Mean   :   4.522            Mean   :0                
##  3rd Qu.:  35.000            3rd Qu.:0                
##  Max.   : 286.000            Max.   :0                
##  NA's   :30083                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 5.00                 Min.   :  1.00             Min.   :     1  
##  1st Qu.:12.50                 1st Qu.: 29.00             1st Qu.: 21437  
##  Median :18.00                 Median : 53.00             Median : 39068  
##  Mean   :20.91                 Mean   : 53.51             Mean   : 39965  
##  3rd Qu.:29.50                 3rd Qu.: 76.00             3rd Qu.: 54246  
##  Max.   :42.00                 Max.   :100.00             Max.   :132453  
##  NA's   :38028                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-15   Q2 2008: 2867         
##  1st Qu.: 2550      1st Qu.:2007-11-06   Q3 2008: 2455         
##  Median : 4500      Median :2009-10-23   Q1 2008: 2038         
##  Mean   : 6189      Mean   :2009-10-01   Q2 2007: 1853         
##  3rd Qu.: 8000      3rd Qu.:2011-10-07   Q1 2007: 1788         
##  Max.   :35000      Max.   :2014-02-21   (Other):27051         
##                                          NA's   :   22         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  63CA34120866140639431C9:    8   Min.   :   0.00    Min.   :    0      
##  739C338135235294782AE75:    8   1st Qu.:  94.15    1st Qu.: 3188      
##  7E1733653050264822FAA3D:    8   Median : 171.10    Median : 5445      
##  C70934206057523078260C7:    8   Mean   : 218.78    Mean   : 7323      
##  16083364744933457E57FB9:    7   3rd Qu.: 297.00    3rd Qu.: 9844      
##  A833340429888765780A3F0:    7   Max.   :2251.51    Max.   :40702      
##  (Other)                :38028                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees  LP_ServiceFees   
##  Min.   :    0                Min.   :   -0.003   Min.   :-589.95  
##  1st Qu.: 2550                1st Qu.:  336.135   1st Qu.: -79.83  
##  Median : 4500                Median :  781.290   Median : -38.35  
##  Mean   : 6183                Mean   : 1139.779   Mean   : -57.65  
##  3rd Qu.: 8000                3rd Qu.: 1537.440   3rd Qu.: -16.62  
##  Max.   :35000                Max.   :15617.030   Max.   :   2.87  
##                                                                    
##  LP_CollectionFees   LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-1996.860   Min.   :0             Min.   :0          
##  1st Qu.:    0.000   1st Qu.:0             1st Qu.:0          
##  Median :    0.000   Median :0             Median :0          
##  Mean   :   -5.104   Mean   :0             Mean   :0          
##  3rd Qu.:    0.000   3rd Qu.:0             3rd Qu.:0          
##  Max.   :    0.000   Max.   :0             Max.   :0          
##                                                               
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :0                       Min.   :0.7000   Min.   : 0.00000  
##  1st Qu.:0                       1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :0                       Median :1.0000   Median : 0.00000  
##  Mean   :0                       Mean   :0.9987   Mean   : 0.09245  
##  3rd Qu.:0                       3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :0                       Max.   :1.0110   Max.   :39.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   : 0.00000           Min.   :    0.00            Min.   :   1.0  
##  1st Qu.: 0.00000           1st Qu.:    0.00            1st Qu.:  34.0  
##  Median : 0.00000           Median :    0.00            Median :  74.0  
##  Mean   : 0.05158           Mean   :   36.28            Mean   : 108.2  
##  3rd Qu.: 0.00000           3rd Qu.:    0.00            3rd Qu.: 147.0  
##  Max.   :33.00000           Max.   :25000.00            Max.   :1189.0  
## 

What I see here is that LP_CollectionFees is non-0 primarily for people with poor credit grades or prosper ratings, and disproportionately for those whose loans have been charged off or defaulted. All of them have apparently paid collection fees.

All records where LP_NetPrincipalLoss is 0 were either charged off, or defaulted. Likewise, all records where LP_NonPrincipalRecoverypayments is 0 were charged off or defaulted.

All this leads me to provisionally conclude that the LP- records reflect actual, not predicted, measures of whether or to what extent a loan was repaid, and not predicted payments, and further that these measures accurately reflect what they are meant to measure, for all customers.

Determining Relevant Measures

I am primarily interested at looking at how well predictors of lender or borrower profit (so to say) correlate with actual profit. For this, I will exclude loans that are currently open, although it would for example be possible to look at how well various predictors correlate with likelihood of being current or past due.

The most relevant outcomes for lenders seem to be the following: LoanStatus (current of end status of the loan), LP_CustomerPayments (cumulative payments made by customers, prior to any charge-offs), LoanOriginalAmount (what the original loan was for, for determining percentage lost/repaid), and LP_NetPrincipalLoss (amount still uncollected after recoveries). LP_ServiceFees and LP_CollectionFees are also relevant to determing final yield/loss, and can be added to LP_NetPrincipalLoss to approximate total loss. Here I will create a new variable to consolidate the yield and loss measures I am interested in, so as to avoid unecessary and (for current purposes) uninteresting complexity in plotting outcomes. I will also take a closer look at data where either yield seems relatively extreme.

relevant_data <- data %>%
  filter(LoanStatus %in% c("Cancelled","Chargedoff","Completed","Defaulted")) %>%
  mutate(PercentYield = ((LoanOriginalAmount-LP_NetPrincipalLoss+LP_ServiceFees+LP_CollectionFees+LP_NonPrincipalRecoverypayments+LP_InterestandFees)/LoanOriginalAmount)-1) %>%
  mutate(Completed = LoanStatus=="Completed")

relevant_data[c(1,4,5),] %>% select(LoanOriginalAmount, starts_with("LP_"),PercentYield, Completed) %>% rowid_to_column() %>% gather(var, value, -rowid) %>% spread(rowid, value) %>% print(n = Inf)
## # A tibble: 11 x 4
##    var                                    `1`       `2`        `3`
##    <chr>                                <dbl>     <dbl>      <dbl>
##  1 Completed                            1.00      0          0    
##  2 LoanOriginalAmount                9425      4000      10000    
##  3 LP_CollectionFees                    0         0          0    
##  4 LP_CustomerPayments              11396       521       5325    
##  5 LP_CustomerPrincipalPayments      9425       210       3987    
##  6 LP_GrossPrincipalLoss                0      3790       6013    
##  7 LP_InterestandFees                1971       311       1338    
##  8 LP_NetPrincipalLoss                  0      3790       6013    
##  9 LP_NonPrincipalRecoverypayments      0         0        269    
## 10 LP_ServiceFees                  -  133     -   9.81  -   54.6  
## 11 PercentYield                         0.195 -   0.872 -    0.446
summary(relevant_data)
##                    ListingKey    ListingNumber     ListingCreationDate 
##  018A360063948152589C8BE:    2   Min.   :      4   Min.   :2005-11-09  
##  30F435938764424435A1188:    2   1st Qu.: 186264   1st Qu.:2007-08-13  
##  32943590099161153292459:    2   Median : 386511   Median :2008-08-21  
##  6DFC3591891372387BB41B2:    2   Mean   : 369056   Mean   :2009-07-14  
##  778D35919242972923313E0:    2   3rd Qu.: 524183   3rd Qu.:2011-08-27  
##  82FD35914405776692938D4:    2   Max.   :1204824   Max.   :2014-02-13  
##  (Other)                :55077                                         
##   CreditGrade         Term                        LoanStatus   
##  C      : 5649   Min.   :12.00   Completed             :38074  
##  D      : 5153   1st Qu.:36.00   Chargedoff            :11992  
##  B      : 4389   Median :36.00   Defaulted             : 5018  
##  AA     : 3509   Mean   :36.94   Cancelled             :    5  
##  HR     : 3508   3rd Qu.:36.00   Current               :    0  
##  (Other): 6745   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :26136                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2005-11-25   Min.   :0.00653   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2009-07-14   1st Qu.:0.14974   1st Qu.:0.1350   1st Qu.: 0.1250  
##  Median :2011-04-05   Median :0.21434   Median :0.1945   Median : 0.1826  
##  Mean   :2011-03-07   Mean   :0.22219   Mean   :0.2004   Mean   : 0.1903  
##  3rd Qu.:2013-01-30   3rd Qu.:0.29510   3rd Qu.:0.2699   3rd Qu.: 0.2572  
##  Max.   :2014-03-10   Max.   :0.51229   Max.   :0.4975   Max.   : 0.4925  
##                       NA's   :25                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.183          Min.   :0.005   Min.   :-0.183  
##  1st Qu.: 0.111          1st Qu.:0.052   1st Qu.: 0.078  
##  Median : 0.172          Median :0.098   Median : 0.114  
##  Mean   : 0.176          Mean   :0.094   Mean   : 0.108  
##  3rd Qu.: 0.247          3rd Qu.:0.140   3rd Qu.: 0.136  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :29084           NA's   :29084   NA's   :29084   
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      : 5869       Min.   : 1.000  
##  1st Qu.:2.000     E      : 3830       1st Qu.: 5.000  
##  Median :3.000     C      : 3817       Median : 6.000  
##  Mean   :3.663     HR     : 3725       Mean   : 6.266  
##  3rd Qu.:5.000     A      : 3608       3rd Qu.: 8.000  
##  Max.   :7.000     (Other): 5156       Max.   :11.000  
##  NA's   :29084     NA's   :29084       NA's   :29084   
##  ListingCategory.num BorrowerState                      Occupation   
##  1      :17868       CA     : 7263   Other                   :14056  
##  0      :16952       FL     : 3078   Professional            : 6515  
##  7      : 6042       IL     : 3039   Computer Programmer     : 2494  
##  3      : 4257       GA     : 2783   Administrative Assistant: 1934  
##  2      : 3244       TX     : 2752   Sales - Commission      : 1809  
##  4      : 2395       (Other):30659   (Other)                 :26013  
##  (Other): 4331       NA's   : 5515   NA's                    : 2268  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :24958   Min.   :  0.00           Mode :logical      
##  Employed     :16491   1st Qu.: 21.00           FALSE:29202        
##  Not available: 5347   Median : 52.00           TRUE :25887        
##  Self-employed: 2926   Mean   : 80.89                              
##  Part-time    : 1056   3rd Qu.:112.00                              
##  (Other)      : 2056   Max.   :755.00                              
##  NA's         : 2255   NA's   :7615                                
##  CurrentlyInGroup                    GroupKey     DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20: 1061   Min.   :2005-11-09  
##  FALSE:43245      3D4D3366260257624AB272D:  806   1st Qu.:2007-08-06  
##  TRUE :11844      6A3B336601725506917317E:  672   Median :2008-08-20  
##                   FEF83377364176536637E50:  582   Mean   :2009-07-14  
##                   C9643379247860156A00EC0:  342   3rd Qu.:2011-08-29  
##                   (Other)                : 9206   Max.   :2014-02-13  
##                   NA's                   :42420                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1947-08-24     
##  1st Qu.:640.0         1st Qu.:659.0         1st Qu.:1990-09-28     
##  Median :680.0         Median :699.0         Median :1995-10-10     
##  Mean   :671.7         Mean   :690.7         Mean   :1994-12-01     
##  3rd Qu.:720.0         3rd Qu.:739.0         3rd Qu.:2000-01-01     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-06-19     
##  NA's   :591           NA's   :591           NA's   :697            
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 6.000     1st Qu.: 5.000   1st Qu.: 15.00            
##  Median : 9.000     Median : 8.000   Median : 23.00            
##  Mean   : 9.569     Mean   : 8.338   Mean   : 25.28            
##  3rd Qu.:13.000     3rd Qu.:11.000   3rd Qu.: 33.00            
##  Max.   :59.000     Max.   :51.000   Max.   :136.00            
##  NA's   :7604       NA's   :7604     NA's   :697               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 3.000        1st Qu.:   60.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  183.0             Median :  1.000     
##  Mean   : 6.078        Mean   :  325.3             Mean   :  2.052     
##  3rd Qu.: 8.000        3rd Qu.:  418.0             3rd Qu.:  3.000     
##  Max.   :51.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :697         
##  TotalInquiries    CurrentDelinquencies AmountDelinquent
##  Min.   :  0.000   Min.   : 0.0000      Min.   :     0  
##  1st Qu.:  2.000   1st Qu.: 0.0000      1st Qu.:     0  
##  Median :  5.000   Median : 0.0000      Median :     0  
##  Mean   :  7.167   Mean   : 0.9064      Mean   :  1051  
##  3rd Qu.:  9.000   3rd Qu.: 1.0000      3rd Qu.:     0  
##  Max.   :379.000   Max.   :83.0000      Max.   :444745  
##  NA's   :1159      NA's   :697          NA's   :7622    
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 4.572          Mean   : 0.3311         
##  3rd Qu.: 4.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :30.0000         
##  NA's   :990             NA's   :697             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.000             Min.   :      0        Min.   :0.00       
##  1st Qu.:0.000             1st Qu.:   1614        1st Qu.:0.21       
##  Median :0.000             Median :   6073        Median :0.56       
##  Mean   :0.024             Mean   :  15689        Mean   :0.53       
##  3rd Qu.:0.000             3rd Qu.:  16169        3rd Qu.:0.85       
##  Max.   :7.000             Max.   :1435667        Max.   :5.95       
##  NA's   :7604              NA's   :7604           NA's   :7604       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  0.00   Min.   :0.000            
##  1st Qu.:   506          1st Qu.: 13.00   1st Qu.:0.770            
##  Median :  3246          Median : 20.00   Median :0.920            
##  Mean   : 10844          Mean   : 21.79   Mean   :0.856            
##  3rd Qu.: 12046          3rd Qu.: 29.00   3rd Qu.:1.000            
##  Max.   :646285          Max.   :126.00   Max.   :1.000            
##  NA's   :7544            NA's   :7544     NA's   :7544             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange   
##  Min.   : 0.000          Min.   : 0.00     $25,000-49,999:16344  
##  1st Qu.: 0.000          1st Qu.: 0.13     $50,000-74,999:12789  
##  Median : 1.000          Median : 0.20     Not displayed : 7741  
##  Mean   : 0.909          Mean   : 0.29     $75,000-99,999: 6442  
##  3rd Qu.: 1.000          3rd Qu.: 0.30     $100,000+     : 6064  
##  Max.   :20.000          Max.   :10.01     $1-24,999     : 4571  
##  NA's   :7544            NA's   :4230      (Other)       : 1138  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      08C43696561586194AC381C:    2  
##  FALSE:4301       1st Qu.:  2809      09303699897852595CD59DD:    2  
##  TRUE :50788      Median :  4167      114D37056655628721BD6C8:    2  
##                   Mean   :  5054      156836977849742636AE34F:    2  
##                   3rd Qu.:  6250      56D73700259224545E36FBC:    2  
##                   Max.   :618548      63113695530739927C7EA06:    2  
##                                       (Other)                :55077  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :0.00      Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.00      1st Qu.:  9.00             1st Qu.:  9.00       
##  Median :1.00      Median : 15.00             Median : 14.00       
##  Mean   :1.33      Mean   : 20.08             Mean   : 19.49       
##  3rd Qu.:1.00      3rd Qu.: 28.00             3rd Qu.: 27.00       
##  Max.   :7.00      Max.   :120.00             Max.   :114.00       
##  NA's   :44550     NA's   :44550              NA's   :44550        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.00                       Min.   : 0.00                  
##  1st Qu.: 0.00                       1st Qu.: 0.00                  
##  Median : 0.00                       Median : 0.00                  
##  Mean   : 0.54                       Mean   : 0.05                  
##  3rd Qu.: 0.00                       3rd Qu.: 0.00                  
##  Max.   :42.00                       Max.   :21.00                  
##  NA's   :44550                       NA's   :44550                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   :    0            Min.   :    0              
##  1st Qu.: 3000            1st Qu.:    0              
##  Median : 5000            Median : 1098              
##  Mean   : 7105            Mean   : 2333              
##  3rd Qu.: 9500            3rd Qu.: 3383              
##  Max.   :60001            Max.   :22587              
##  NA's   :44550            NA's   :44550              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-194.00             Min.   :   0.0           
##  1st Qu.: -32.00             1st Qu.:   0.0           
##  Median :   0.00             Median :   0.0           
##  Mean   :   1.15             Mean   : 314.5           
##  3rd Qu.:  32.00             3rd Qu.: 228.0           
##  Max.   : 286.00             Max.   :2704.0           
##  NA's   :44647                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 0.00                 Min.   :  1.00             Min.   :     1  
##  1st Qu.: 9.00                 1st Qu.: 30.00             1st Qu.: 18295  
##  Median :14.00                 Median : 66.00             Median : 36353  
##  Mean   :16.27                 Mean   : 55.63             Mean   : 37935  
##  3rd Qu.:22.00                 3rd Qu.: 79.00             3rd Qu.: 53169  
##  Max.   :44.00                 Max.   :100.00             Max.   :132453  
##  NA's   :38145                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-15   Q2 2008: 4344         
##  1st Qu.: 2600      1st Qu.:2007-08-24   Q3 2008: 3602         
##  Median : 4500      Median :2008-09-05   Q2 2007: 3118         
##  Mean   : 6261      Mean   :2009-07-28   Q1 2007: 3079         
##  3rd Qu.: 8000      3rd Qu.:2011-09-13   Q1 2008: 3074         
##  Max.   :35000      Max.   :2014-02-21   (Other):37850         
##                                          NA's   :   22         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  16083364744933457E57FB9:    8   Min.   :   0.00    Min.   :   -2.35   
##  63CA34120866140639431C9:    8   1st Qu.:  98.29    1st Qu.: 2029.68   
##  739C338135235294782AE75:    8   Median : 172.60    Median : 4208.27   
##  7E1733653050264822FAA3D:    8   Mean   : 223.72    Mean   : 5927.15   
##  C70934206057523078260C7:    8   3rd Qu.: 300.43    3rd Qu.: 7935.81   
##  458E33818543661332BC1BE:    7   Max.   :2251.51    Max.   :40702.39   
##  (Other)                :55042                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0                Min.   :   -2.35   Min.   :-664.87  
##  1st Qu.: 1355                1st Qu.:  331.07   1st Qu.: -73.02  
##  Median : 3150                Median :  763.98   Median : -34.25  
##  Mean   : 4800                Mean   : 1126.98   Mean   : -53.66  
##  3rd Qu.: 6500                3rd Qu.: 1509.52   3rd Qu.: -14.65  
##  Max.   :35000                Max.   :15617.03   Max.   :  32.06  
##                                                                   
##  LP_CollectionFees LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9275     Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.:    0     1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :    0     Median :    0.0       Median :    0.0    
##  Mean   :  -25     Mean   : 1448.7       Mean   : 1409.3    
##  3rd Qu.:    0     3rd Qu.: 1430.9       3rd Qu.: 1297.9    
##  Max.   :    0     Max.   :25000.0       Max.   :25000.0    
##                                                             
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :    0                   Min.   :0.7000   Min.   : 0.00000  
##  1st Qu.:    0                   1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :    0                   Median :1.0000   Median : 0.00000  
##  Mean   :   52                   Mean   :0.9986   Mean   : 0.08938  
##  3rd Qu.:    0                   3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :21118                   Max.   :1.0110   Max.   :39.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   : 0.00000           Min.   :    0.00            Min.   :   1.0  
##  1st Qu.: 0.00000           1st Qu.:    0.00            1st Qu.:  31.0  
##  Median : 0.00000           Median :    0.00            Median :  69.0  
##  Mean   : 0.04611           Mean   :   33.59            Mean   : 104.9  
##  3rd Qu.: 0.00000           3rd Qu.:    0.00            3rd Qu.: 142.0  
##  Max.   :33.00000           Max.   :25000.00            Max.   :1189.0  
##                                                                         
##   PercentYield      Completed      
##  Min.   :-1.00092   Mode :logical  
##  1st Qu.:-0.16895   FALSE:17015    
##  Median : 0.10328   TRUE :38074    
##  Mean   :-0.02445                  
##  3rd Qu.: 0.22522                  
##  Max.   : 1.72976                  
## 

Everything seems to check out with the PercentYield calculation.

summary(filter(relevant_data, PercentYield > 1))
##                    ListingKey ListingNumber    ListingCreationDate 
##  086C3431914095124482AF4:1    Min.   : 26634   Min.   :2006-07-25  
##  26C03463657651689E79CAB:1    1st Qu.:257558   1st Qu.:2007-12-25  
##  2A97342295011203308BCFB:1    Median :350085   Median :2008-06-13  
##  C3383423078575403A751E5:1    Mean   :298617   Mean   :2008-04-13  
##  D21E3365863982309A3ECAC:1    3rd Qu.:386123   3rd Qu.:2008-08-20  
##  F1A6342598834278592579C:1    Max.   :426236   Max.   :2009-09-29  
##  (Other)                :1                                         
##   CreditGrade      Term                     LoanStatus
##  HR     :4    Min.   :36   Defaulted             :4   
##  E      :2    1st Qu.:36   Completed             :2   
##  NC     :0    Median :36   Chargedoff            :1   
##  D      :0    Mean   :36   Cancelled             :0   
##  C      :0    3rd Qu.:36   Current               :0   
##  (Other):0    Max.   :36   FinalPaymentInProgress:0   
##  NA's   :1                 (Other)               :0   
##    ClosedDate          BorrowerAPR      BorrowerRate     LenderYield    
##  Min.   :2008-01-19   Min.   :0.2691   Min.   :0.2500   Min.   :0.2450  
##  1st Qu.:2008-12-24   1st Qu.:0.3218   1st Qu.:0.3000   1st Qu.:0.2900  
##  Median :2011-10-23   Median :0.3745   Median :0.3500   Median :0.3400  
##  Mean   :2010-09-11   Mean   :0.3468   Mean   :0.3214   Mean   :0.3121  
##  3rd Qu.:2011-12-31   3rd Qu.:0.3745   3rd Qu.:0.3500   3rd Qu.:0.3400  
##  Max.   :2013-01-18   Max.   :0.3915   Max.   :0.3500   Max.   :0.3400  
##                                                                         
##  EstimatedEffectiveYield EstimatedLoss  EstimatedReturn  ProsperRating.num
##  Min.   :0.1299          Min.   :0.19   Min.   :0.1299   Min.   :1        
##  1st Qu.:0.1299          1st Qu.:0.19   1st Qu.:0.1299   1st Qu.:1        
##  Median :0.1299          Median :0.19   Median :0.1299   Median :1        
##  Mean   :0.1299          Mean   :0.19   Mean   :0.1299   Mean   :1        
##  3rd Qu.:0.1299          3rd Qu.:0.19   3rd Qu.:0.1299   3rd Qu.:1        
##  Max.   :0.1299          Max.   :0.19   Max.   :0.1299   Max.   :1        
##  NA's   :6               NA's   :6      NA's   :6        NA's   :6        
##  ProsperRating.alpha  ProsperScore ListingCategory.num BorrowerState
##  HR     :1           Min.   :5     0      :2           PA     :2    
##  NC     :0           1st Qu.:5     1      :1           MD     :1    
##  E      :0           Median :5     2      :1           OR     :1    
##  D      :0           Mean   :5     4      :1           RI     :1    
##  C      :0           3rd Qu.:5     6      :1           TX     :1    
##  (Other):0           Max.   :5     7      :1           VA     :1    
##  NA's   :6           NA's   :6     (Other):0           (Other):0    
##                Occupation      EmploymentStatus EmploymentStatusDuration
##  Clerical           :2    Full-time    :6       Min.   :10.00           
##  Computer Programmer:2    Not available:1       1st Qu.:11.25           
##  Professional       :1    Employed     :0       Median :18.00           
##  Skilled Labor      :1    Not employed :0       Mean   :30.50           
##  Teacher            :1    Other        :0       3rd Qu.:27.00           
##  Accountant/CPA     :0    Part-time    :0       Max.   :98.00           
##  (Other)            :0    (Other)      :0       NA's   :1               
##  IsBorrowerHomeowner CurrentlyInGroup                    GroupKey
##  Mode :logical       Mode :logical    00343376901312423168731:0  
##  FALSE:4             FALSE:7          00943382969547936B0C529:0  
##  TRUE :3                              00AE3392027644405556335:0  
##                                       016833805323396548B2370:0  
##                                       01A133661136027706728BE:0  
##                                       (Other)                :0  
##                                       NA's                   :7  
##  DateCreditPulled     CreditScoreRangeLower CreditScoreRangeUpper
##  Min.   :2006-07-20   Min.   :520.0         Min.   :539.0        
##  1st Qu.:2007-12-01   1st Qu.:520.0         1st Qu.:539.0        
##  Median :2008-06-13   Median :540.0         Median :559.0        
##  Mean   :2008-03-28   Mean   :548.6         Mean   :567.6        
##  3rd Qu.:2008-07-30   3rd Qu.:560.0         3rd Qu.:579.0        
##  Max.   :2009-09-09   Max.   :620.0         Max.   :639.0        
##                                                                  
##  FirstRecordedCreditLine CurrentCreditLines OpenCreditLines 
##  Min.   :1990-11-20      Min.   : 1.000     Min.   : 2.000  
##  1st Qu.:1995-08-30      1st Qu.: 3.250     1st Qu.: 3.250  
##  Median :1996-09-11      Median : 7.500     Median : 4.500  
##  Mean   :1996-12-02      Mean   : 6.667     Mean   : 5.333  
##  3rd Qu.:1997-09-28      3rd Qu.: 8.750     3rd Qu.: 5.750  
##  Max.   :2004-01-23      Max.   :13.000     Max.   :12.000  
##                          NA's   :1          NA's   :1       
##  TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 7.00              Min.   : 0.000       
##  1st Qu.:19.00              1st Qu.: 1.500       
##  Median :26.00              Median : 2.000       
##  Mean   :23.71              Mean   : 3.286       
##  3rd Qu.:31.00              3rd Qu.: 3.500       
##  Max.   :33.00              Max.   :11.000       
##                                                  
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries  
##  Min.   :  0.0               Min.   :0            Min.   : 1.000  
##  1st Qu.: 52.5               1st Qu.:1            1st Qu.: 4.500  
##  Median : 69.0               Median :2            Median : 8.000  
##  Mean   :133.1               Mean   :3            Mean   : 9.571  
##  3rd Qu.:151.5               3rd Qu.:4            3rd Qu.:12.000  
##  Max.   :455.0               Max.   :9            Max.   :25.000  
##                                                                   
##  CurrentDelinquencies AmountDelinquent DelinquenciesLast7Years
##  Min.   :0.000        Min.   :   0.0   Min.   : 0.00          
##  1st Qu.:1.000        1st Qu.: 115.5   1st Qu.: 1.00          
##  Median :2.000        Median : 444.0   Median :10.00          
##  Mean   :3.286        Mean   : 468.8   Mean   :10.86          
##  3rd Qu.:5.000        3rd Qu.: 697.5   3rd Qu.:11.00          
##  Max.   :9.000        Max.   :1137.0   Max.   :42.00          
##                       NA's   :1                               
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   :0.0000           Min.   :0                 Min.   :  965         
##  1st Qu.:0.0000           1st Qu.:0                 1st Qu.: 1178         
##  Median :0.0000           Median :0                 Median : 1852         
##  Mean   :0.2857           Mean   :0                 Mean   : 3756         
##  3rd Qu.:0.5000           3rd Qu.:0                 3rd Qu.: 3407         
##  Max.   :1.0000           Max.   :0                 Max.   :12989         
##                           NA's   :1                 NA's   :1             
##  BankcardUtilization AvailableBankcardCredit  TotalTrades   
##  Min.   :0.7200      Min.   :  0.0           Min.   : 5.00  
##  1st Qu.:0.9925      1st Qu.:  0.0           1st Qu.:17.75  
##  Median :1.0400      Median : 40.5           Median :20.50  
##  Mean   :1.0733      Mean   :117.5           Mean   :19.67  
##  3rd Qu.:1.1100      3rd Qu.:124.5           3rd Qu.:24.00  
##  Max.   :1.5300      Max.   :485.0           Max.   :30.00  
##  NA's   :1           NA's   :1               NA's   :1      
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.2700            Min.   :0.00            Min.   :0.130    
##  1st Qu.:0.5950            1st Qu.:0.00            1st Qu.:0.135    
##  Median :0.6700            Median :0.00            Median :0.160    
##  Mean   :0.6383            Mean   :0.50            Mean   :0.210    
##  3rd Qu.:0.7750            3rd Qu.:0.75            3rd Qu.:0.285    
##  Max.   :0.8400            Max.   :2.00            Max.   :0.340    
##  NA's   :1                 NA's   :1                                
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  $25,000-49,999:3    Mode:logical     Min.   :2000       
##  $50,000-74,999:2    TRUE:7           1st Qu.:2125       
##  Not displayed :1                     Median :2730       
##  $75,000-99,999:1                     Mean   :3803       
##  Not employed  :0                     3rd Qu.:5277       
##  $0            :0                     Max.   :7083       
##  (Other)       :0                                        
##                     LoanKey  TotalProsperLoans TotalProsperPaymentsBilled
##  3BD634237789212566E5A34:1   Min.   :1         Min.   :12                
##  47A7336630616348408D321:1   1st Qu.:1         1st Qu.:12                
##  5CC5343201507741750BCD4:1   Median :1         Median :12                
##  5F1835683805778648B4FA5:1   Mean   :1         Mean   :12                
##  6BC834233584328164F4A6B:1   3rd Qu.:1         3rd Qu.:12                
##  70EA33942910132621878E3:1   Max.   :1         Max.   :12                
##  (Other)                :1   NA's   :6         NA's   :6                 
##  OnTimeProsperPayments ProsperPaymentsLessThanOneMonthLate
##  Min.   :12            Min.   :0                          
##  1st Qu.:12            1st Qu.:0                          
##  Median :12            Median :0                          
##  Mean   :12            Mean   :0                          
##  3rd Qu.:12            3rd Qu.:0                          
##  Max.   :12            Max.   :0                          
##  NA's   :6             NA's   :6                          
##  ProsperPaymentsOneMonthPlusLate ProsperPrincipalBorrowed
##  Min.   :0                       Min.   :2000            
##  1st Qu.:0                       1st Qu.:2000            
##  Median :0                       Median :2000            
##  Mean   :0                       Mean   :2000            
##  3rd Qu.:0                       3rd Qu.:2000            
##  Max.   :0                       Max.   :2000            
##  NA's   :6                       NA's   :6               
##  ProsperPrincipalOutstanding ScorexChangeAtTimeOfListing
##  Min.   :1546                Min.   :1                  
##  1st Qu.:1546                1st Qu.:1                  
##  Median :1546                Median :1                  
##  Mean   :1546                Mean   :1                  
##  3rd Qu.:1546                3rd Qu.:1                  
##  Max.   :1546                Max.   :1                  
##  NA's   :6                   NA's   :6                  
##  LoanCurrentDaysDelinquent LoanFirstDefaultedCycleNumber
##  Min.   :   0.0            Min.   : 1.0                 
##  1st Qu.: 132.0            1st Qu.: 6.0                 
##  Median : 271.0            Median :14.0                 
##  Mean   : 565.4            Mean   :16.4                 
##  3rd Qu.:1017.0            3rd Qu.:21.0                 
##  Max.   :1389.0            Max.   :40.0                 
##                            NA's   :2                    
##  LoanMonthsSinceOrigination   LoanNumber    LoanOriginalAmount
##  Min.   :53.00              Min.   : 2265   Min.   :1000      
##  1st Qu.:66.50              1st Qu.:25008   1st Qu.:1000      
##  Median :69.00              Median :33200   Median :1000      
##  Mean   :70.71              Mean   :28076   Mean   :1007      
##  3rd Qu.:74.50              3rd Qu.:36071   3rd Qu.:1000      
##  Max.   :91.00              Max.   :38907   Max.   :1050      
##                                                               
##  LoanOriginationDate  LoanOriginationQuarter                   MemberKey
##  Min.   :2006-08-02   Q2 2008:2              07553392568434371B69BEA:1  
##  1st Qu.:2008-01-02   Q3 2008:2              0E6E3429920780746133A4B:1  
##  Median :2008-06-23   Q3 2006:1              2F503420860619469E1B2B2:1  
##  Mean   :2008-04-24   Q3 2007:1              6B093422578451456F5E359:1  
##  3rd Qu.:2008-09-01   Q4 2009:1              90713365131318276755B6C:1  
##  Max.   :2009-10-15   Q1 2006:0              DB59342339253365288B87F:1  
##                       (Other):0              (Other)                :1  
##  MonthlyLoanPayment LP_CustomerPayments LP_CustomerPrincipalPayments
##  Min.   : 0.00      Min.   :  41.91     Min.   :   0.0              
##  1st Qu.: 0.00      1st Qu.: 476.66     1st Qu.: 180.3              
##  Median : 0.00      Median :1794.40     Median : 712.8              
##  Mean   :16.34      Mean   :1331.67     Mean   : 547.6              
##  3rd Qu.:34.58      3rd Qu.:2049.74     3rd Qu.: 879.8              
##  Max.   :45.24      Max.   :2432.58     Max.   :1000.0              
##                                                                     
##  LP_InterestandFees LP_ServiceFees    LP_CollectionFees 
##  Min.   :  24.63    Min.   :-26.710   Min.   :-161.500  
##  1st Qu.: 305.04    1st Qu.:-24.400   1st Qu.: -70.325  
##  Median :1034.88    Median : -9.530   Median : -45.160  
##  Mean   : 784.12    Mean   :-14.334   Mean   : -51.132  
##  3rd Qu.:1193.34    3rd Qu.: -7.225   3rd Qu.:  -5.306  
##  Max.   :1432.58    Max.   : -0.850   Max.   :   0.000  
##                                                         
##  LP_GrossPrincipalLoss LP_NetPrincipalLoss LP_NonPrincipalRecoverypayments
##  Min.   :   0.0        Min.   :-33.660     Min.   :   0.00                
##  1st Qu.: 137.1        1st Qu.:  0.000     1st Qu.:  13.04                
##  Median : 337.2        Median :  0.000     Median :  29.79                
##  Mean   : 464.4        Mean   : -3.986     Mean   : 571.45                
##  3rd Qu.: 819.7        3rd Qu.:  0.000     3rd Qu.:1119.15                
##  Max.   :1000.0        Max.   :  5.760     Max.   :1705.98                
##                                                                           
##  PercentFunded Recommendations InvestmentFromFriendsCount
##  Min.   :1     Min.   :0       Min.   :0                 
##  1st Qu.:1     1st Qu.:0       1st Qu.:0                 
##  Median :1     Median :0       Median :0                 
##  Mean   :1     Mean   :0       Mean   :0                 
##  3rd Qu.:1     3rd Qu.:0       3rd Qu.:0                 
##  Max.   :1     Max.   :0       Max.   :0                 
##                                                          
##  InvestmentFromFriendsAmount   Investors      PercentYield  
##  Min.   :0                   Min.   : 1.00   Min.   :1.004  
##  1st Qu.:0                   1st Qu.: 6.50   1st Qu.:1.116  
##  Median :0                   Median :13.00   Median :1.189  
##  Mean   :0                   Mean   :11.71   Mean   :1.287  
##  3rd Qu.:0                   3rd Qu.:15.50   3rd Qu.:1.426  
##  Max.   :0                   Max.   :24.00   Max.   :1.730  
## 
summary(filter(relevant_data, PercentYield < -1))
##                    ListingKey ListingNumber    ListingCreationDate 
##  34CD3587284005601C13ED3:1    Min.   :568875   Min.   :2012-03-14  
##  4CBB3541533232375D4CEA5:1    1st Qu.:655721   1st Qu.:2012-10-01  
##  97A13588293719997E19C94:1    Median :781791   Median :2013-04-16  
##  C0D935652403806997CFF5E:1    Mean   :753201   Mean   :2013-02-08  
##  00003546482094282EF90E5:0    3rd Qu.:879270   3rd Qu.:2013-08-24  
##  00013542762124763F20254:0    Max.   :880346   Max.   :2013-08-26  
##  (Other)                :0                                         
##   CreditGrade      Term                     LoanStatus
##  NC     :0    Min.   :36   Chargedoff            :4   
##  HR     :0    1st Qu.:36   Cancelled             :0   
##  E      :0    Median :36   Completed             :0   
##  D      :0    Mean   :42   Current               :0   
##  C      :0    3rd Qu.:42   Defaulted             :0   
##  (Other):0    Max.   :60   FinalPaymentInProgress:0   
##  NA's   :4                 (Other)               :0   
##    ClosedDate          BorrowerAPR      BorrowerRate     LenderYield    
##  Min.   :2012-08-23   Min.   :0.1370   Min.   :0.1089   Min.   :0.0989  
##  1st Qu.:2013-03-19   1st Qu.:0.2044   1st Qu.:0.1790   1st Qu.:0.1690  
##  Median :2013-09-27   Median :0.2762   Median :0.2442   Median :0.2341  
##  Mean   :2013-07-20   Mean   :0.2618   Mean   :0.2287   Mean   :0.2187  
##  3rd Qu.:2014-01-28   3rd Qu.:0.3335   3rd Qu.:0.2939   3rd Qu.:0.2838  
##  Max.   :2014-01-28   Max.   :0.3580   Max.   :0.3177   Max.   :0.3077  
##                                                                         
##  EstimatedEffectiveYield EstimatedLoss    EstimatedReturn 
##  Min.   :0.0956          Min.   :0.0299   Min.   :0.0657  
##  1st Qu.:0.1589          1st Qu.:0.0599   1st Qu.:0.0990  
##  Median :0.2156          Median :0.1037   Median :0.1119  
##  Mean   :0.2041          Mean   :0.1006   Mean   :0.1035  
##  3rd Qu.:0.2608          3rd Qu.:0.1444   3rd Qu.:0.1164  
##  Max.   :0.2896          Max.   :0.1650   Max.   :0.1246  
##                                                           
##  ProsperRating.num ProsperRating.alpha  ProsperScore  ListingCategory.num
##  Min.   :1.00      HR     :1           Min.   :3.00   1      :2          
##  1st Qu.:1.75      E      :1           1st Qu.:3.00   7      :1          
##  Median :3.00      C      :1           Median :4.00   18     :1          
##  Mean   :3.25      A      :1           Mean   :4.75   0      :0          
##  3rd Qu.:4.50      NC     :0           3rd Qu.:5.75   2      :0          
##  Max.   :6.00      D      :0           Max.   :8.00   3      :0          
##                    (Other):0                          (Other):0          
##  BorrowerState                    Occupation      EmploymentStatus
##  CA     :1     Administrative Assistant:1    Employed     :4      
##  PA     :1     Computer Programmer     :1    Full-time    :0      
##  VA     :1     Executive               :1    Not available:0      
##  WA     :1     Truck Driver            :1    Not employed :0      
##  AK     :0     Accountant/CPA          :0    Other        :0      
##  AL     :0     Analyst                 :0    Part-time    :0      
##  (Other):0     (Other)                 :0    (Other)      :0      
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  2.00           Mode :logical       Mode :logical   
##  1st Qu.: 26.75           FALSE:1             FALSE:4         
##  Median : 39.00           TRUE :3                             
##  Mean   : 60.25                                               
##  3rd Qu.: 72.50                                               
##  Max.   :161.00                                               
##                                                               
##                     GroupKey DateCreditPulled     CreditScoreRangeLower
##  00343376901312423168731:0   Min.   :2012-03-14   Min.   :660          
##  00943382969547936B0C529:0   1st Qu.:2012-10-01   1st Qu.:660          
##  00AE3392027644405556335:0   Median :2013-04-16   Median :710          
##  016833805323396548B2370:0   Mean   :2013-02-08   Mean   :715          
##  01A133661136027706728BE:0   3rd Qu.:2013-08-24   3rd Qu.:765          
##  (Other)                :0   Max.   :2013-08-26   Max.   :780          
##  NA's                   :4                                             
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   :679           Min.   :1980-05-31      Min.   : 0.00     
##  1st Qu.:679           1st Qu.:1985-10-09      1st Qu.: 4.50     
##  Median :729           Median :1991-08-23      Median :10.00     
##  Mean   :734           Mean   :1992-03-14      Mean   : 8.75     
##  3rd Qu.:784           3rd Qu.:1998-01-26      3rd Qu.:14.25     
##  Max.   :799           Max.   :2005-02-10      Max.   :15.00     
##                                                                  
##  OpenCreditLines TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 0.00   Min.   :12.00              Min.   : 0.00        
##  1st Qu.: 4.50   1st Qu.:14.25              1st Qu.: 0.75        
##  Median : 9.50   Median :19.00              Median : 5.50        
##  Mean   : 8.25   Mean   :20.50              Mean   : 5.50        
##  3rd Qu.:13.25   3rd Qu.:25.25              3rd Qu.:10.25        
##  Max.   :14.00   Max.   :32.00              Max.   :11.00        
##                                                                  
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries
##  Min.   :   0.00             Min.   :0.00         Min.   :1.00  
##  1st Qu.:  18.75             1st Qu.:0.75         1st Qu.:1.00  
##  Median : 716.50             Median :1.00         Median :1.50  
##  Mean   : 949.00             Mean   :1.00         Mean   :2.75  
##  3rd Qu.:1646.75             3rd Qu.:1.25         3rd Qu.:3.25  
##  Max.   :2363.00             Max.   :2.00         Max.   :7.00  
##                                                                 
##  CurrentDelinquencies AmountDelinquent DelinquenciesLast7Years
##  Min.   :0.00         Min.   :  0.00   Min.   : 0.00          
##  1st Qu.:0.00         1st Qu.:  0.00   1st Qu.: 0.00          
##  Median :0.00         Median :  0.00   Median : 2.50          
##  Mean   :0.25         Mean   : 51.75   Mean   : 8.25          
##  3rd Qu.:0.25         3rd Qu.: 51.75   3rd Qu.:10.75          
##  Max.   :1.00         Max.   :207.00   Max.   :28.00          
##                                                               
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   :0.00             Min.   :0                 Min.   :    0.0       
##  1st Qu.:0.00             1st Qu.:0                 1st Qu.:  188.2       
##  Median :0.00             Median :0                 Median :10729.0       
##  Mean   :0.25             Mean   :0                 Mean   :28864.5       
##  3rd Qu.:0.25             3rd Qu.:0                 3rd Qu.:39405.2       
##  Max.   :1.00             Max.   :0                 Max.   :94000.0       
##                                                                           
##  BankcardUtilization AvailableBankcardCredit  TotalTrades   
##  Min.   :0.0000      Min.   :    0.00        Min.   :11.00  
##  1st Qu.:0.3825      1st Qu.:   36.75        1st Qu.:11.75  
##  Median :0.6700      Median :  262.00        Median :17.00  
##  Mean   :0.5850      Mean   : 5327.50        Mean   :18.25  
##  3rd Qu.:0.8725      3rd Qu.: 5552.75        3rd Qu.:23.50  
##  Max.   :1.0000      Max.   :20786.00        Max.   :28.00  
##                                                             
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.380             Min.   :0               Min.   :0.0400   
##  1st Qu.:0.635             1st Qu.:0               1st Qu.:0.1000   
##  Median :0.840             Median :0               Median :0.1500   
##  Mean   :0.765             Mean   :0               Mean   :0.1325   
##  3rd Qu.:0.970             3rd Qu.:0               3rd Qu.:0.1825   
##  Max.   :1.000             Max.   :0               Max.   :0.1900   
##                                                                     
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  $25,000-49,999:2    Mode:logical     Min.   : 2333      
##  $100,000+     :2    TRUE:4           1st Qu.: 2833      
##  Not displayed :0                     Median : 9000      
##  Not employed  :0                     Mean   :10188      
##  $0            :0                     3rd Qu.:16354      
##  $1-24,999     :0                     Max.   :20417      
##  (Other)       :0                                        
##                     LoanKey  TotalProsperLoans TotalProsperPaymentsBilled
##  1724369081908936857F8EF:1   Min.   : NA       Min.   : NA               
##  74FF36470534073180AA9D0:1   1st Qu.: NA       1st Qu.: NA               
##  9D24367018687929200F8B0:1   Median : NA       Median : NA               
##  E8EC36918514705070CF2EE:1   Mean   :NaN       Mean   :NaN               
##  00003683605746079487FF7:0   3rd Qu.: NA       3rd Qu.: NA               
##  00013421083473792D70F75:0   Max.   : NA       Max.   : NA               
##  (Other)                :0   NA's   :4         NA's   :4                 
##  OnTimeProsperPayments ProsperPaymentsLessThanOneMonthLate
##  Min.   : NA           Min.   : NA                        
##  1st Qu.: NA           1st Qu.: NA                        
##  Median : NA           Median : NA                        
##  Mean   :NaN           Mean   :NaN                        
##  3rd Qu.: NA           3rd Qu.: NA                        
##  Max.   : NA           Max.   : NA                        
##  NA's   :4             NA's   :4                          
##  ProsperPaymentsOneMonthPlusLate ProsperPrincipalBorrowed
##  Min.   : NA                     Min.   : NA             
##  1st Qu.: NA                     1st Qu.: NA             
##  Median : NA                     Median : NA             
##  Mean   :NaN                     Mean   :NaN             
##  3rd Qu.: NA                     3rd Qu.: NA             
##  Max.   : NA                     Max.   : NA             
##  NA's   :4                       NA's   :4               
##  ProsperPrincipalOutstanding ScorexChangeAtTimeOfListing
##  Min.   : NA                 Min.   : NA                
##  1st Qu.: NA                 1st Qu.: NA                
##  Median : NA                 Median : NA                
##  Mean   :NaN                 Mean   :NaN                
##  3rd Qu.: NA                 3rd Qu.: NA                
##  Max.   : NA                 Max.   : NA                
##  NA's   :4                   NA's   :4                  
##  LoanCurrentDaysDelinquent LoanFirstDefaultedCycleNumber
##  Min.   :163.0             Min.   :5.00                 
##  1st Qu.:163.0             1st Qu.:5.00                 
##  Median :285.5             Median :5.00                 
##  Mean   :355.0             Mean   :5.25                 
##  3rd Qu.:477.5             3rd Qu.:5.25                 
##  Max.   :686.0             Max.   :6.00                 
##                                                         
##  LoanMonthsSinceOrigination   LoanNumber     LoanOriginalAmount
##  Min.   : 7.00              Min.   : 62368   Min.   : 3000     
##  1st Qu.: 7.00              1st Qu.: 75132   1st Qu.: 3000     
##  Median :11.00              Median : 89780   Median :10439     
##  Mean   :13.25              Mean   : 85586   Mean   :12220     
##  3rd Qu.:17.25              3rd Qu.:100235   3rd Qu.:19658     
##  Max.   :24.00              Max.   :100414   Max.   :25000     
##                                                                
##  LoanOriginationDate  LoanOriginationQuarter                   MemberKey
##  Min.   :2012-03-23   Q3 2013:2              0CE43587499687303B53D9D:1  
##  1st Qu.:2012-10-17   Q1 2012:1              39A5356469782230058076F:1  
##  Median :2013-04-27   Q4 2012:1              49053588028781958EA2262:1  
##  Mean   :2013-02-17   Q1 2006:0              6AB43541952481860D5AA31:1  
##  3rd Qu.:2013-08-28   Q2 2006:0              00003397697413387CAF966:0  
##  Max.   :2013-08-28   Q3 2006:0              000035297015484885C64F8:0  
##                       (Other):0              (Other)                :0  
##  MonthlyLoanPayment LP_CustomerPayments LP_CustomerPrincipalPayments
##  Min.   :125.0      Min.   :-2.3499     Min.   :0                   
##  1st Qu.:129.0      1st Qu.:-0.5875     1st Qu.:0                   
##  Median :355.0      Median : 0.0000     Median :0                   
##  Mean   :375.2      Mean   :-0.5875     Mean   :0                   
##  3rd Qu.:601.2      3rd Qu.: 0.0000     3rd Qu.:0                   
##  Max.   :665.7      Max.   : 0.0000     Max.   :0                   
##                                                                     
##  LP_InterestandFees LP_ServiceFees LP_CollectionFees LP_GrossPrincipalLoss
##  Min.   :-2.3499    Min.   :0      Min.   :-2.360    Min.   : 3000        
##  1st Qu.:-0.5875    1st Qu.:0      1st Qu.:-1.865    1st Qu.: 3000        
##  Median : 0.0000    Median :0      Median :-1.050    Median :10439        
##  Mean   :-0.5875    Mean   :0      Mean   :-1.115    Mean   :12220        
##  3rd Qu.: 0.0000    3rd Qu.:0      3rd Qu.:-0.300    3rd Qu.:19659        
##  Max.   : 0.0000    Max.   :0      Max.   : 0.000    Max.   :25000        
##                                                                           
##  LP_NetPrincipalLoss LP_NonPrincipalRecoverypayments PercentFunded   
##  Min.   : 3000       Min.   :0                       Min.   :0.8721  
##  1st Qu.: 3000       1st Qu.:0                       1st Qu.:0.9680  
##  Median :10439       Median :0                       Median :1.0000  
##  Mean   :12220       Mean   :0                       Mean   :0.9680  
##  3rd Qu.:19659       3rd Qu.:0                       3rd Qu.:1.0000  
##  Max.   :25000       Max.   :0                       Max.   :1.0000  
##                                                                      
##  Recommendations InvestmentFromFriendsCount InvestmentFromFriendsAmount
##  Min.   :0       Min.   :0                  Min.   :0                  
##  1st Qu.:0       1st Qu.:0                  1st Qu.:0                  
##  Median :0       Median :0                  Median :0                  
##  Mean   :0       Mean   :0                  Mean   :0                  
##  3rd Qu.:0       3rd Qu.:0                  3rd Qu.:0                  
##  Max.   :0       Max.   :0                  Max.   :0                  
##                                                                        
##    Investors       PercentYield   
##  Min.   :  1.00   Min.   :-1.001  
##  1st Qu.:  5.50   1st Qu.:-1.001  
##  Median : 38.00   Median :-1.000  
##  Mean   : 65.75   Mean   :-1.000  
##  3rd Qu.: 98.25   3rd Qu.:-1.000  
##  Max.   :186.00   Max.   :-1.000  
## 

For PercentYield, extreme gains seem to be cases of loans being defaulted or charged off, but fully, or almost fully recovered with large interest payments.

Extreme losses are due to loans that were charged off - where no payments were ever made or recovered, but investors lost money on collection fees. Overall, this data appears sensible.

Now I want to take a closer look at the relationship between this measure, and the other primary measure of interest: loan status.

LoanStatus by PercentYield

relevant_data %>% group_by(LoanStatus) %>% ggplot(aes(x = LoanStatus, y = PercentYield)) + geom_boxplot() + labs(title = "PercentYield by LoanStatus", y = "Percent Yield", x = "Loan Status") + scale_y_continuous(labels = percent)

Here it can be seen that cancelled loans do nor result in any money gained or lost by the lender, that lenders earn on average 20% on top of their initial investment for completed loans, and that thy lose about 50% of their original investment, on average, for charged off or defaulted loans, signifying that lenders may be taking a substantial risk, particularly with loans that are likely to be defaulted on or charged off.

Average PercentYield is roughly equal between loans that have been charged off, and those that defaulted, suggesting that the relevant measure here might be simply whether the loan was completed, or not (if one doesn’t want to take into account the possibility that loans defaulted on will still be repaid at some point).

summary(filter(relevant_data, LoanStatus == "Chargedoff"))
##                    ListingKey    ListingNumber    ListingCreationDate 
##  00013542762124763F20254:    1   Min.   :   156   Min.   :2006-02-11  
##  000433785890431972B4743:    1   1st Qu.:188514   1st Qu.:2007-08-16  
##  0005353671687550573289D:    1   Median :369917   Median :2008-07-19  
##  001035373445372274F74E2:    1   Mean   :364865   Mean   :2009-07-14  
##  00143395229257559A91663:    1   3rd Qu.:534036   3rd Qu.:2011-10-19  
##  00153399719267548BE59C1:    1   Max.   :932346   Max.   :2013-09-26  
##  (Other)                :11986                                        
##   CreditGrade        Term                        LoanStatus   
##  D      :1343   Min.   :12.00   Chargedoff            :11992  
##  C      :1310   1st Qu.:36.00   Cancelled             :    0  
##  HR     :1242   Median :36.00   Completed             :    0  
##  E      : 946   Mean   :38.03   Current               :    0  
##  B      : 909   3rd Qu.:36.00   Defaulted             :    0  
##  (Other): 900   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :5342                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield    
##  Min.   :2006-11-23   Min.   :0.01823   Min.   :0.0100   Min.   :0.0000  
##  1st Qu.:2009-03-11   1st Qu.:0.19003   1st Qu.:0.1769   1st Qu.:0.1650  
##  Median :2010-08-17   Median :0.26271   Median :0.2400   Median :0.2300  
##  Mean   :2011-01-07   Mean   :0.25775   Mean   :0.2354   Mean   :0.2247  
##  3rd Qu.:2013-01-29   3rd Qu.:0.32958   3rd Qu.:0.2975   3rd Qu.:0.2869  
##  Max.   :2014-03-10   Max.   :0.46201   Max.   :0.4500   Max.   :0.4325  
##                                                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.182          Min.   :0.006   Min.   :-0.182  
##  1st Qu.: 0.164          1st Qu.:0.087   1st Qu.: 0.111  
##  Median : 0.236          Median :0.112   Median : 0.125  
##  Mean   : 0.218          Mean   :0.116   Mean   : 0.123  
##  3rd Qu.: 0.286          3rd Qu.:0.149   3rd Qu.: 0.144  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :6656            NA's   :6656    NA's   :6656    
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      :1395        Min.   : 1.000  
##  1st Qu.:2.000     HR     :1215        1st Qu.: 4.000  
##  Median :3.000     E      :1131        Median : 5.000  
##  Mean   :2.883     C      : 706        Mean   : 5.391  
##  3rd Qu.:4.000     B      : 500        3rd Qu.: 7.000  
##  Max.   :7.000     (Other): 389        Max.   :10.000  
##  NA's   :6656      NA's   :6656        NA's   :6656    
##  ListingCategory.num BorrowerState                     Occupation  
##  0      :3792        CA     :1574   Other                   :3459  
##  1      :3655        FL     : 761   Professional            :1229  
##  7      :1225        GA     : 688   Sales - Commission      : 485  
##  3      :1108        IL     : 685   Administrative Assistant: 460  
##  2      : 669        TX     : 579   Clerical                : 444  
##  4      : 577        (Other):6665   (Other)                 :5624  
##  (Other): 966        NA's   :1040   NA's                    : 291  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :5343    Min.   :  0.00           Mode :logical      
##  Employed     :3529    1st Qu.: 19.00           FALSE:6661         
##  Not available:1065    Median : 51.00           TRUE :5331         
##  Self-employed: 897    Mean   : 80.18                              
##  Other        : 295    3rd Qu.:111.00                              
##  (Other)      : 572    Max.   :755.00                              
##  NA's         : 291    NA's   :1358                                
##  CurrentlyInGroup                    GroupKey    DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20: 291   Min.   :2006-01-29  
##  FALSE:9187       FEF83377364176536637E50: 235   1st Qu.:2007-08-07  
##  TRUE :2805       3D4D3366260257624AB272D: 139   Median :2008-07-11  
##                   6A3B336601725506917317E: 120   Mean   :2009-07-10  
##                   9BBE337094173775621CD34: 119   3rd Qu.:2011-10-19  
##                   (Other)                :1968   Max.   :2013-09-26  
##                   NA's                   :9120                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1955-05-01     
##  1st Qu.:600.0         1st Qu.:619.0         1st Qu.:1990-07-01     
##  Median :660.0         Median :679.0         Median :1995-09-06     
##  Mean   :648.9         Mean   :667.9         Mean   :1994-10-22     
##  3rd Qu.:700.0         3rd Qu.:719.0         3rd Qu.:2000-02-01     
##  Max.   :860.0         Max.   :879.0         Max.   :2011-08-10     
##  NA's   :48            NA's   :48            NA's   :76             
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 5.000     1st Qu.: 4.000   1st Qu.: 14.00            
##  Median : 8.000     Median : 7.000   Median : 22.00            
##  Mean   : 8.846     Mean   : 7.728   Mean   : 24.58            
##  3rd Qu.:12.000     3rd Qu.:10.000   3rd Qu.: 33.00            
##  Max.   :48.000     Max.   :43.000   Max.   :129.00            
##  NA's   :1356       NA's   :1356     NA's   :76                
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 2.000        1st Qu.:   42.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  152.0             Median :  2.000     
##  Mean   : 5.575        Mean   :  308.5             Mean   :  2.767     
##  3rd Qu.: 8.000        3rd Qu.:  383.2             3rd Qu.:  4.000     
##  Max.   :41.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :76          
##  TotalInquiries    CurrentDelinquencies AmountDelinquent
##  Min.   :  0.000   Min.   : 0.000       Min.   :     0  
##  1st Qu.:  3.000   1st Qu.: 0.000       1st Qu.:     0  
##  Median :  6.000   Median : 0.000       Median :     0  
##  Mean   :  8.916   Mean   : 1.384       Mean   :  1493  
##  3rd Qu.: 12.000   3rd Qu.: 1.000       3rd Qu.:   130  
##  Max.   :379.000   Max.   :64.000       Max.   :444745  
##  NA's   :126       NA's   :76           NA's   :1358    
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 6.022          Mean   : 0.4553         
##  3rd Qu.: 7.000          3rd Qu.: 1.0000         
##  Max.   :99.000          Max.   :30.0000         
##  NA's   :120             NA's   :76              
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.0000            Min.   :     0.0       Min.   :0.0000     
##  1st Qu.:0.0000            1st Qu.:   892.8       1st Qu.:0.2100     
##  Median :0.0000            Median :  4440.5       Median :0.6000     
##  Mean   :0.0351            Mean   : 14005.8       Mean   :0.5516     
##  3rd Qu.:0.0000            3rd Qu.: 13882.0       3rd Qu.:0.8800     
##  Max.   :7.0000            Max.   :600223.0       Max.   :2.6800     
##  NA's   :1356              NA's   :1356           NA's   :1356       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  1.00   Min.   :0.000            
##  1st Qu.:   175          1st Qu.: 11.00   1st Qu.:0.710            
##  Median :  1587          Median : 18.00   Median :0.870            
##  Mean   :  6897          Mean   : 20.44   Mean   :0.815            
##  3rd Qu.:  7196          3rd Qu.: 28.00   3rd Qu.:1.000            
##  Max.   :364284          Max.   :118.00   Max.   :1.000            
##  NA's   :1342            NA's   :1342     NA's   :1342             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange  
##  Min.   : 0.000          Min.   : 0.0000   $25,000-49,999:4162  
##  1st Qu.: 0.000          1st Qu.: 0.1300   $50,000-74,999:2633  
##  Median : 1.000          Median : 0.2100   Not displayed :1380  
##  Mean   : 1.078          Mean   : 0.3392   $1-24,999     :1329  
##  3rd Qu.: 2.000          3rd Qu.: 0.3300   $75,000-99,999:1153  
##  Max.   :17.000          Max.   :10.0100   $100,000+     : 968  
##  NA's   :1342            NA's   :1241      (Other)       : 367  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      00023650503696810C531F7:    1  
##  FALSE:1260       1st Qu.:  2500      0004363753221955965B646:    1  
##  TRUE :10732      Median :  3750      000836579711360490B130B:    1  
##                   Mean   :  4486      000B3656359179267F91999:    1  
##                   3rd Qu.:  5500      001C336540093530548F61A:    1  
##                   Max.   :208333      001E3652350675777DB09A9:    1  
##                                       (Other)                :11986  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :1.000     Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.000     1st Qu.:  9.00             1st Qu.:  8.00       
##  Median :1.000     Median : 12.00             Median : 12.00       
##  Mean   :1.253     Mean   : 18.86             Mean   : 18.03       
##  3rd Qu.:1.000     3rd Qu.: 25.00             3rd Qu.: 24.00       
##  Max.   :7.000     Max.   :103.00             Max.   :101.00       
##  NA's   :10015     NA's   :10015              NA's   :10015        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   :0.00                   
##  1st Qu.: 0.000                      1st Qu.:0.00                   
##  Median : 0.000                      Median :0.00                   
##  Mean   : 0.775                      Mean   :0.06                   
##  3rd Qu.: 0.000                      3rd Qu.:0.00                   
##  Max.   :24.000                      Max.   :8.00                   
##  NA's   :10015                       NA's   :10015                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0.00           
##  1st Qu.: 3000            1st Qu.:    4.33           
##  Median : 5000            Median : 2068.99           
##  Mean   : 6653            Mean   : 3054.05           
##  3rd Qu.: 8250            3rd Qu.: 4161.71           
##  Max.   :45000            Max.   :21862.26           
##  NA's   :10015            NA's   :10015              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-194.00             Min.   : 121.0           
##  1st Qu.: -40.00             1st Qu.: 507.8           
##  Median :  -8.00             Median :1389.0           
##  Mean   : -10.56             Mean   :1256.0           
##  3rd Qu.:  18.00             3rd Qu.:1927.0           
##  Max.   : 214.00             Max.   :2704.0           
##  NA's   :10016                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 1.00                 Min.   : 5.00              Min.   :    59  
##  1st Qu.:10.00                 1st Qu.:29.00              1st Qu.: 18375  
##  Median :16.00                 Median :68.00              Median : 34842  
##  Mean   :17.06                 Mean   :55.72              Mean   : 37703  
##  3rd Qu.:23.00                 3rd Qu.:79.00              3rd Qu.: 55294  
##  Max.   :41.00                 Max.   :97.00              Max.   :103467  
##  NA's   :7                                                                
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2006-02-21   Q2 2008:1093          
##  1st Qu.: 3000      1st Qu.:2007-08-28   Q2 2007: 840          
##  Median : 4500      Median :2008-07-31   Q3 2008: 829          
##  Mean   : 6399      Mean   :2009-07-26   Q3 2007: 732          
##  3rd Qu.: 8000      3rd Qu.:2011-10-31   Q1 2008: 731          
##  Max.   :25000      Max.   :2013-10-01   Q1 2007: 723          
##                                          (Other):7044          
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  006C3373804016872128132:    2   Min.   :  29.97    Min.   :   -2.35   
##  009C35078002646985845CF:    2   1st Qu.: 112.24    1st Qu.:  789.91   
##  00C43387968070538859D91:    2   Median : 173.71    Median : 1788.68   
##  01DC337523139473583CD4C:    2   Mean   : 235.36    Mean   : 2888.32   
##  01F733654063535141541E2:    2   3rd Qu.: 308.21    3rd Qu.: 3666.66   
##  02493426138410789969A20:    2   Max.   :1552.76    Max.   :29825.73   
##  (Other)                :11980                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0.0              Min.   :   -2.35   Min.   :-664.87  
##  1st Qu.:  352.1              1st Qu.:  376.45   1st Qu.: -61.53  
##  Median :  914.4              Median :  793.99   Median : -29.15  
##  Mean   : 1731.2              Mean   : 1157.11   Mean   : -46.86  
##  3rd Qu.: 2093.6              3rd Qu.: 1520.05   3rd Qu.: -12.83  
##  Max.   :24074.3              Max.   :14329.49   Max.   :   0.00  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :    0         Min.   :  -69.19   
##  1st Qu.:  -18.75   1st Qu.: 1817         1st Qu.: 1765.05   
##  Median :    0.00   Median : 3345         Median : 3301.16   
##  Mean   :  -49.84   Mean   : 4663         Mean   : 4608.30   
##  3rd Qu.:    0.00   3rd Qu.: 5956         3rd Qu.: 5887.32   
##  Max.   :    0.00   Max.   :25000         Max.   :25000.00   
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations   
##  Min.   :    0.0                 Min.   :0.7012   Min.   : 0.00000  
##  1st Qu.:    0.0                 1st Qu.:1.0000   1st Qu.: 0.00000  
##  Median :    0.0                 Median :1.0000   Median : 0.00000  
##  Mean   :  136.9                 Mean   :0.9978   Mean   : 0.08547  
##  3rd Qu.:    0.0                 3rd Qu.:1.0000   3rd Qu.: 0.00000  
##  Max.   :21117.9                 Max.   :1.0000   Max.   :16.00000  
##                                                                     
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   :0.00000            Min.   :    0.00            Min.   :  1.00  
##  1st Qu.:0.00000            1st Qu.:    0.00            1st Qu.: 28.00  
##  Median :0.00000            Median :    0.00            Median : 61.00  
##  Mean   :0.03619            Mean   :   32.14            Mean   : 96.11  
##  3rd Qu.:0.00000            3rd Qu.:    0.00            3rd Qu.:127.00  
##  Max.   :9.00000            Max.   :12500.00            Max.   :870.00  
##                                                                         
##   PercentYield     Completed      
##  Min.   :-1.0009   Mode :logical  
##  1st Qu.:-0.7870   FALSE:11992    
##  Median :-0.5762                  
##  Mean   :-0.5095                  
##  3rd Qu.:-0.2864                  
##  Max.   : 1.5901                  
## 
summary(filter(relevant_data, LoanStatus == "Defaulted"))
##                    ListingKey   ListingNumber     ListingCreationDate 
##  00003546482094282EF90E5:   1   Min.   :     99   Min.   :2006-01-25  
##  001C3375545731729D10129:   1   1st Qu.:  69064   1st Qu.:2006-12-01  
##  001D33654297803968707DD:   1   Median : 178389   Median :2007-07-29  
##  00293413955892317967503:   1   Mean   : 232428   Mean   :2008-03-21  
##  005B3378937131619860EC9:   1   3rd Qu.: 367876   3rd Qu.:2008-07-15  
##  00773373220677521177C26:   1   Max.   :1099553   Max.   :2013-12-27  
##  (Other)                :5012                                         
##   CreditGrade        Term                        LoanStatus  
##  HR     : 891   Min.   :12.00   Defaulted             :5018  
##  C      : 729   1st Qu.:36.00   Cancelled             :   0  
##  D      : 684   Median :36.00   Chargedoff            :   0  
##  E      : 665   Mean   :36.84   Completed             :   0  
##  B      : 493   3rd Qu.:36.00   Current               :   0  
##  (Other): 548   Max.   :60.00   FinalPaymentInProgress:   0  
##  NA's   :1008                   (Other)               :   0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2006-09-05   Min.   :0.00864   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2007-10-28   1st Qu.:0.17722   1st Qu.:0.1650   1st Qu.: 0.1549  
##  Median :2009-02-12   Median :0.24001   Median :0.2296   Median : 0.2150  
##  Mean   :2009-07-10   Mean   :0.23893   Mean   :0.2231   Mean   : 0.2121  
##  3rd Qu.:2010-08-06   3rd Qu.:0.29776   3rd Qu.:0.2875   3rd Qu.: 0.2700  
##  Max.   :2014-03-04   Max.   :0.50633   Max.   :0.4975   Max.   : 0.4800  
##                                                                           
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.046          Min.   :0.006   Min.   :-0.046  
##  1st Qu.: 0.146          1st Qu.:0.085   1st Qu.: 0.109  
##  Median : 0.233          Median :0.112   Median : 0.127  
##  Mean   : 0.209          Mean   :0.112   Mean   : 0.123  
##  3rd Qu.: 0.285          3rd Qu.:0.147   3rd Qu.: 0.144  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.254  
##  NA's   :4013            NA's   :4013    NA's   :4013    
##  ProsperRating.num ProsperRating.alpha  ProsperScore   ListingCategory.num
##  Min.   :1.000     D      : 282        Min.   : 1.00   0      :2903       
##  1st Qu.:2.000     HR     : 209        1st Qu.: 4.00   1      :1045       
##  Median :3.000     E      : 193        Median : 6.00   3      : 301       
##  Mean   :3.016     C      : 134        Mean   : 5.62   7      : 266       
##  3rd Qu.:4.000     B      :  88        3rd Qu.: 7.00   4      : 198       
##  Max.   :7.000     (Other):  99        Max.   :11.00   2      : 141       
##  NA's   :4013      NA's   :4013        NA's   :4013    (Other): 164       
##  BorrowerState                     Occupation        EmploymentStatus
##  CA     : 732   Other                   :1281   Full-time    :2217   
##  GA     : 343   Professional            : 482   Not available:1204   
##  TX     : 342   Clerical                : 268   Employed     : 630   
##  IL     : 337   Administrative Assistant: 216   Self-employed: 246   
##  FL     : 222   Sales - Commission      : 205   Part-time    :  60   
##  (Other):2453   (Other)                 :2044   (Other)      : 139   
##  NA's   : 589   NA's                    : 522   NA's         : 522   
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.00           Mode :logical       Mode :logical   
##  1st Qu.: 19.00           FALSE:2744          FALSE:3155      
##  Median : 52.00           TRUE :2274          TRUE :1863      
##  Mean   : 79.48                                               
##  3rd Qu.:112.50                                               
##  Max.   :554.00                                               
##  NA's   :1727                                                 
##                     GroupKey    DateCreditPulled     CreditScoreRangeLower
##  783C3371218786870A73D20: 208   Min.   :2005-12-11   Min.   :  0.0        
##  6A3B336601725506917317E: 156   1st Qu.:2006-11-27   1st Qu.:560.0        
##  3D4D3366260257624AB272D: 155   Median :2007-07-24   Median :640.0        
##  FE113364863511529673D04:  99   Mean   :2008-03-17   Mean   :620.9        
##  FEF83377364176536637E50:  79   3rd Qu.:2008-07-10   3rd Qu.:680.0        
##  (Other)                :1341   Max.   :2013-12-27   Max.   :860.0        
##  NA's                   :2980                        NA's   :126          
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   : 19.0         Min.   :1947-08-24      Min.   : 0.00     
##  1st Qu.:579.0         1st Qu.:1990-08-10      1st Qu.: 6.00     
##  Median :659.0         Median :1995-08-04      Median :10.00     
##  Mean   :639.9         Mean   :1994-07-20      Mean   :10.64     
##  3rd Qu.:699.0         3rd Qu.:1999-08-12      3rd Qu.:14.00     
##  Max.   :879.0         Max.   :2009-08-13      Max.   :52.00     
##  NA's   :126           NA's   :157             NA's   :1727      
##  OpenCreditLines  TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 0.000   Min.   :  2.00             Min.   : 0.000       
##  1st Qu.: 5.000   1st Qu.: 16.00             1st Qu.: 2.000       
##  Median : 8.000   Median : 24.00             Median : 5.000       
##  Mean   : 9.168   Mean   : 26.12             Mean   : 5.646       
##  3rd Qu.:12.000   3rd Qu.: 35.00             3rd Qu.: 8.000       
##  Max.   :51.000   Max.   :101.00             Max.   :51.000       
##  NA's   :1727     NA's   :157                                     
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries  
##  Min.   :   0.0              Min.   : 0.000       Min.   :  0.00  
##  1st Qu.:  25.0              1st Qu.: 1.000       1st Qu.:  4.00  
##  Median : 155.0              Median : 2.000       Median :  8.00  
##  Mean   : 344.7              Mean   : 3.538       Mean   : 11.07  
##  3rd Qu.: 446.8              3rd Qu.: 5.000       3rd Qu.: 15.00  
##  Max.   :8001.0              Max.   :53.000       Max.   :158.00  
##                              NA's   :157          NA's   :243     
##  CurrentDelinquencies AmountDelinquent   DelinquenciesLast7Years
##  Min.   : 0.000       Min.   :     0.0   Min.   : 0.000         
##  1st Qu.: 0.000       1st Qu.:     0.0   1st Qu.: 0.000         
##  Median : 0.000       Median :     0.0   Median : 0.000         
##  Mean   : 2.137       Mean   :  1207.4   Mean   : 6.025         
##  3rd Qu.: 2.000       3rd Qu.:    24.5   3rd Qu.: 7.000         
##  Max.   :83.000       Max.   :183396.0   Max.   :99.000         
##  NA's   :157          NA's   :1727       NA's   :223            
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   : 0.0000          Min.   :0.0000            Min.   :     0        
##  1st Qu.: 0.0000          1st Qu.:0.0000            1st Qu.:  2120        
##  Median : 0.0000          Median :0.0000            Median :  8233        
##  Mean   : 0.4203          Mean   :0.0383            Mean   : 20584        
##  3rd Qu.: 1.0000          3rd Qu.:0.0000            3rd Qu.: 21938        
##  Max.   :22.0000          Max.   :4.0000            Max.   :486503        
##  NA's   :157              NA's   :1727              NA's   :1727          
##  BankcardUtilization AvailableBankcardCredit  TotalTrades  
##  Min.   :0.0000      Min.   :     0          Min.   : 1.0  
##  1st Qu.:0.3400      1st Qu.:   302          1st Qu.:14.0  
##  Median :0.6900      Median :  2436          Median :22.0  
##  Mean   :0.6133      Mean   :  7889          Mean   :23.6  
##  3rd Qu.:0.9000      3rd Qu.:  9018          3rd Qu.:31.0  
##  Max.   :4.7300      Max.   :289168          Max.   :86.0  
##  NA's   :1727        NA's   :1724            NA's   :1724  
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.0000            Min.   : 0.000          Min.   : 0.0000  
##  1st Qu.:0.7300            1st Qu.: 0.000          1st Qu.: 0.1400  
##  Median :0.9000            Median : 1.000          Median : 0.2200  
##  Mean   :0.8327            Mean   : 1.256          Mean   : 0.3693  
##  3rd Qu.:1.0000            3rd Qu.: 2.000          3rd Qu.: 0.3500  
##  Max.   :1.0000            Max.   :11.000          Max.   :10.0100  
##  NA's   :1724              NA's   :1724            NA's   :255      
##          IncomeRange   IncomeVerifiable StatedMonthlyIncome
##  Not displayed :1747   Mode :logical    Min.   :    0      
##  $25,000-49,999:1290   FALSE:258        1st Qu.: 2500      
##  $50,000-74,999: 874   TRUE :4760       Median : 3708      
##  $75,000-99,999: 375                    Mean   : 4367      
##  $1-24,999     : 334                    3rd Qu.: 5417      
##  $100,000+     : 322                    Max.   :58617      
##  (Other)       :  76                                       
##                     LoanKey     TotalProsperLoans
##  000B3366346245964D6187E:   1   Min.   :1.000    
##  00193564075967640E1A9A1:   1   1st Qu.:1.000    
##  00483379319461501511D07:   1   Median :1.000    
##  004C3382466805517A0159B:   1   Mean   :1.256    
##  004E33659258749958952AB:   1   3rd Qu.:1.000    
##  004E3381838563573A42B75:   1   Max.   :6.000    
##  (Other)                :5012   NA's   :4541     
##  TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:  7.00             1st Qu.:  7.00       
##  Median : 12.00             Median : 12.00       
##  Mean   : 17.89             Mean   : 17.07       
##  3rd Qu.: 23.00             3rd Qu.: 21.00       
##  Max.   :101.00             Max.   :101.00       
##  NA's   :4541               NA's   :4541         
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.000                      Min.   :0.000                  
##  1st Qu.: 0.000                      1st Qu.:0.000                  
##  Median : 0.000                      Median :0.000                  
##  Mean   : 0.692                      Mean   :0.124                  
##  3rd Qu.: 0.000                      3rd Qu.:0.000                  
##  Max.   :26.000                      Max.   :9.000                  
##  NA's   :4541                        NA's   :4541                   
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :    0              
##  1st Qu.: 3000            1st Qu.:    0              
##  Median : 5000            Median : 1717              
##  Mean   : 6709            Mean   : 2933              
##  3rd Qu.: 8000            3rd Qu.: 4147              
##  Max.   :53200            Max.   :22587              
##  NA's   :4541             NA's   :4541               
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-160.000            Min.   :   1.0           
##  1st Qu.: -40.000            1st Qu.: 175.0           
##  Median :   0.000            Median : 249.0           
##  Mean   :  -6.937            Mean   : 451.1           
##  3rd Qu.:  21.500            3rd Qu.: 551.8           
##  Max.   : 176.000            Max.   :2421.0           
##  NA's   :4543                                         
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 0.0                  Min.   : 3.00              Min.   :    29  
##  1st Qu.: 8.0                  1st Qu.:68.00              1st Qu.:  5451  
##  Median :12.0                  Median :79.00              Median : 17895  
##  Mean   :14.3                  Mean   :71.47              Mean   : 23119  
##  3rd Qu.:19.0                  3rd Qu.:87.00              3rd Qu.: 34604  
##  Max.   :44.0                  Max.   :98.00              Max.   :124070  
##  NA's   :105                                                              
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2006-01-27   Q1 2007: 568          
##  1st Qu.: 2550      1st Qu.:2006-12-13   Q4 2006: 557          
##  Median : 4275      Median :2007-08-10   Q3 2006: 474          
##  Mean   : 6487      Mean   :2008-04-02   Q2 2007: 425          
##  3rd Qu.: 8000      3rd Qu.:2008-07-25   Q2 2008: 384          
##  Max.   :25000      Max.   :2013-12-31   Q3 2008: 318          
##                                          (Other):2292          
##                    MemberKey    MonthlyLoanPayment LP_CustomerPayments
##  018B35275926204010E51B6:   2   Min.   :   0.00    Min.   :    0.0    
##  01D33386346150055C7F757:   2   1st Qu.:  99.92    1st Qu.:  452.4    
##  01DA3382241797159B9FE89:   2   Median : 167.62    Median : 1297.3    
##  03863429108114327FA5713:   2   Mean   : 233.49    Mean   : 2607.5    
##  03F43394048903402B10A91:   2   3rd Qu.: 303.10    3rd Qu.: 3300.1    
##  06AC3396777271901E3E43F:   2   Max.   :1102.78    Max.   :34021.8    
##  (Other)                :5006                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees    
##  Min.   :    0.0              Min.   :    0.0    Min.   :-425.880  
##  1st Qu.:  206.1              1st Qu.:  217.5    1st Qu.: -52.258  
##  Median :  680.9              Median :  568.6    Median : -19.980  
##  Mean   : 1648.5              Mean   :  959.0    Mean   : -39.681  
##  3rd Qu.: 1930.1              3rd Qu.: 1252.5    3rd Qu.:  -5.965  
##  Max.   :24939.2              Max.   :12242.0    Max.   :  32.060  
##                                                                    
##  LP_CollectionFees LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-6221.3   Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.:  -16.5   1st Qu.: 1759.3       1st Qu.: 1364.7    
##  Median :    0.0   Median : 3151.0       Median : 2894.3    
##  Mean   : -116.6   Mean   : 4761.6       Mean   : 4459.2    
##  3rd Qu.:    0.0   3rd Qu.: 5998.3       3rd Qu.: 5645.4    
##  Max.   :    0.0   Max.   :25000.0       Max.   :25000.0    
##                                                             
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations  
##  Min.   :    0.0                 Min.   :0.7063   Min.   :0.00000  
##  1st Qu.:    0.0                 1st Qu.:1.0000   1st Qu.:0.00000  
##  Median :    0.0                 Median :1.0000   Median :0.00000  
##  Mean   :  243.7                 Mean   :0.9993   Mean   :0.07553  
##  3rd Qu.:    0.0                 3rd Qu.:1.0000   3rd Qu.:0.00000  
##  Max.   :13605.4                 Max.   :1.0000   Max.   :7.00000  
##                                                                    
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors    
##  Min.   :0.0000             Min.   :   0.00             Min.   :  1.0  
##  1st Qu.:0.0000             1st Qu.:   0.00             1st Qu.: 26.0  
##  Median :0.0000             Median :   0.00             Median : 57.0  
##  Mean   :0.0283             Mean   :  16.61             Mean   :100.7  
##  3rd Qu.:0.0000             3rd Qu.:   0.00             3rd Qu.:132.0  
##  Max.   :7.0000             Max.   :8200.00             Max.   :881.0  
##                                                                        
##   PercentYield     Completed      
##  Min.   :-1.0000   Mode :logical  
##  1st Qu.:-0.8547   FALSE:5018     
##  Median :-0.6678                  
##  Mean   :-0.5176                  
##  3rd Qu.:-0.2665                  
##  Max.   : 1.7298                  
## 

Predicted Profit

Now, before using them as predictors of actual profit, I want to briefly look at the relationship between measures of predicted profit: predicted loss, and predicted yield. In this case, EstimatedEffectiveYield vs. EstimatedLoss seems to be the most informative comparison - how much one stands to gain, total, vs. how much one stands to lose.

EstimatedEffectiveYield vs. EstimatedLoss

ggplot(data, aes(x = EstimatedEffectiveYield, y = EstimatedLoss)) + 
  geom_hex(bins = 50) + 
  geom_smooth(formula = y~x)

This plot shows that, as one would expect, when the estimated effective yield (taking into account fees or possibly lost interest on charge-offs) is around or below zero, the estimated loss rises. It also interestingly shows that as the estimated effective yield rises, so does the estimated loss. Presumably based on historical data, then, Prosper predicts that the more one stands to gain, the more one stands to lose. Lowest estimated risk of loss seems to be around 5% effective yield.

EstimatedLoss vs. LenderYield

ggplot(data, aes(x = EstimatedLoss, y = LenderYield)) + 
  geom_hex(bins = 50) + 
  geom_smooth(formula = y~x)

This graph shows clearly that while predicted lender yield increases with predicted loss, at higher levels of loss, the yield ceases to increase, and levels off, or even drops slightly towards higher levels of predicted loss. This is presumably due to the fact that those loans which lenders profit the most from tend to be higher-interest loans - with those customers who are charged higher interest rates typically being those most likely to fail to repay their loan.

What one would expect, then, in terms of actual profit, is that as borrower demographics ‘worsen’ and interet rates rise, lenders are both more likely to lose, and to stand a chance of earning more.

Scores vs. Actual Profit

Rating

LoanStatus

First, only 5 loans in the relevant data set were ever cancelled, so I will exclude this category from analysis.

relevant_data %>%
  filter(LoanStatus=="Cancelled")
## # A tibble: 5 x 82
##   ListingKey   ListingNumber ListingCreation… CreditGrade  Term LoanStatus
##   <fct>                <int> <date>           <ord>       <int> <fct>     
## 1 093D3365430…          8912 2006-04-26       HR             36 Cancelled 
## 2 F77F3366454…          7646 2006-04-25       C              36 Cancelled 
## 3 09E13366275…           908 2006-02-28       HR             36 Cancelled 
## 4 29453366240…         11364 2006-05-05       HR             36 Cancelled 
## 5 E77E3433009…        410006 2008-10-07       A              36 Cancelled 
## # ... with 76 more variables: ClosedDate <date>, BorrowerAPR <dbl>,
## #   BorrowerRate <dbl>, LenderYield <dbl>, EstimatedEffectiveYield <dbl>,
## #   EstimatedLoss <dbl>, EstimatedReturn <dbl>, ProsperRating.num <int>,
## #   ProsperRating.alpha <ord>, ProsperScore <dbl>,
## #   ListingCategory.num <fct>, BorrowerState <fct>, Occupation <fct>,
## #   EmploymentStatus <fct>, EmploymentStatusDuration <int>,
## #   IsBorrowerHomeowner <lgl>, CurrentlyInGroup <lgl>, GroupKey <fct>,
## #   DateCreditPulled <date>, CreditScoreRangeLower <int>,
## #   CreditScoreRangeUpper <int>, FirstRecordedCreditLine <date>,
## #   CurrentCreditLines <int>, OpenCreditLines <int>,
## #   TotalCreditLinespast7years <int>, OpenRevolvingAccounts <int>,
## #   OpenRevolvingMonthlyPayment <dbl>, InquiriesLast6Months <int>,
## #   TotalInquiries <dbl>, CurrentDelinquencies <int>,
## #   AmountDelinquent <dbl>, DelinquenciesLast7Years <int>,
## #   PublicRecordsLast10Years <int>, PublicRecordsLast12Months <int>,
## #   RevolvingCreditBalance <dbl>, BankcardUtilization <dbl>,
## #   AvailableBankcardCredit <dbl>, TotalTrades <dbl>,
## #   TradesNeverDelinquent.per <dbl>, TradesOpenedLast6Months <dbl>,
## #   DebtToIncomeRatio <dbl>, IncomeRange <ord>, IncomeVerifiable <lgl>,
## #   StatedMonthlyIncome <dbl>, LoanKey <fct>, TotalProsperLoans <int>,
## #   TotalProsperPaymentsBilled <int>, OnTimeProsperPayments <int>,
## #   ProsperPaymentsLessThanOneMonthLate <int>,
## #   ProsperPaymentsOneMonthPlusLate <int>, ProsperPrincipalBorrowed <dbl>,
## #   ProsperPrincipalOutstanding <dbl>, ScorexChangeAtTimeOfListing <int>,
## #   LoanCurrentDaysDelinquent <int>, LoanFirstDefaultedCycleNumber <int>,
## #   LoanMonthsSinceOrigination <int>, LoanNumber <int>,
## #   LoanOriginalAmount <int>, LoanOriginationDate <date>,
## #   LoanOriginationQuarter <ord>, MemberKey <fct>,
## #   MonthlyLoanPayment <dbl>, LP_CustomerPayments <dbl>,
## #   LP_CustomerPrincipalPayments <dbl>, LP_InterestandFees <dbl>,
## #   LP_ServiceFees <dbl>, LP_CollectionFees <dbl>,
## #   LP_GrossPrincipalLoss <dbl>, LP_NetPrincipalLoss <dbl>,
## #   LP_NonPrincipalRecoverypayments <dbl>, PercentFunded <dbl>,
## #   Recommendations <int>, InvestmentFromFriendsCount <int>,
## #   InvestmentFromFriendsAmount <dbl>, Investors <int>, PercentYield <dbl>
relevant_data <- relevant_data %>%
  filter(LoanStatus!="Cancelled")
plot_data <- relevant_data %>% 
  mutate(Rating = coalesce(CreditGrade, ProsperRating.alpha)) %>%
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(Rating, LoanStatus) %>% 
  tally %>% 
  mutate(percent = n/sum(n))

ggplot(plot_data, aes(x = LoanStatus, y=percent)) +
  geom_bar(stat = "identity") +
  labs(title = "Loan Status by Rating") +
  facet_wrap(~Rating) +
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

PercentYield
plot_data <- relevant_data %>% 
  mutate(Rating = coalesce(CreditGrade, ProsperRating.alpha)) %>%
  group_by(Rating) %>%
  summarize(PercentYield = mean(PercentYield))

ggplot(plot_data, aes(x = Rating, y=PercentYield)) +
  geom_bar(stat = "identity")
relevant_data %>% 
  mutate(Rating = coalesce(CreditGrade, ProsperRating.alpha)) %>%
  filter(is.na(Rating)) %>%
  summary()
##                    ListingKey  ListingNumber    ListingCreationDate 
##  0385345033494662260733C:  1   Min.   :149172   Min.   :2007-06-08  
##  04D73431953660481B1EC1D:  1   1st Qu.:306608   1st Qu.:2008-04-08  
##  04F334232790941784498F1:  1   Median :339464   Median :2008-05-26  
##  05153419481232978723A5F:  1   Mean   :341138   Mean   :2008-06-24  
##  059934165217732065237C5:  1   3rd Qu.:397924   3rd Qu.:2008-09-13  
##  06FF342963152332574DF05:  1   Max.   :415961   Max.   :2009-05-06  
##  (Other)                :125                                        
##   CreditGrade       Term                        LoanStatus 
##  NC     :  0   Min.   :12.00   Completed             :122  
##  HR     :  0   1st Qu.:36.00   Chargedoff            :  6  
##  E      :  0   Median :36.00   Defaulted             :  3  
##  D      :  0   Mean   :35.82   Cancelled             :  0  
##  C      :  0   3rd Qu.:36.00   Current               :  0  
##  (Other):  0   Max.   :36.00   FinalPaymentInProgress:  0  
##  NA's   :131                   (Other)               :  0  
##    ClosedDate          BorrowerAPR       BorrowerRate    
##  Min.   :2010-01-28   Min.   :0.06207   Min.   :0.05870  
##  1st Qu.:2011-04-21   1st Qu.:0.11271   1st Qu.:0.09025  
##  Median :2012-04-05   Median :0.17018   Median :0.14000  
##  Mean   :2012-02-01   Mean   :0.18688   Mean   :0.16300  
##  3rd Qu.:2012-10-29   3rd Qu.:0.25811   3rd Qu.:0.22700  
##  Max.   :2013-10-12   Max.   :0.39460   Max.   :0.35300  
##                                                          
##   LenderYield      EstimatedEffectiveYield EstimatedLoss EstimatedReturn
##  Min.   :0.04870   Min.   : NA             Min.   : NA   Min.   : NA    
##  1st Qu.:0.08025   1st Qu.: NA             1st Qu.: NA   1st Qu.: NA    
##  Median :0.13000   Median : NA             Median : NA   Median : NA    
##  Mean   :0.15293   Mean   :NaN             Mean   :NaN   Mean   :NaN    
##  3rd Qu.:0.21700   3rd Qu.: NA             3rd Qu.: NA   3rd Qu.: NA    
##  Max.   :0.34000   Max.   : NA             Max.   : NA   Max.   : NA    
##                    NA's   :131             NA's   :131   NA's   :131    
##  ProsperRating.num ProsperRating.alpha  ProsperScore ListingCategory.num
##  Min.   : NA       NC     :  0         Min.   : NA   1      :66         
##  1st Qu.: NA       HR     :  0         1st Qu.: NA   7      :24         
##  Median : NA       E      :  0         Median : NA   3      :17         
##  Mean   :NaN       D      :  0         Mean   :NaN   2      :11         
##  3rd Qu.: NA       C      :  0         3rd Qu.: NA   6      : 7         
##  Max.   : NA       (Other):  0         Max.   : NA   5      : 6         
##  NA's   :131       NA's   :131         NA's   :131   (Other): 0         
##  BorrowerState                    Occupation      EmploymentStatus
##  CA     :18    Other                   :30   Full-time    :104    
##  TX     :18    Professional            :23   Employed     : 12    
##  NY     : 9    Analyst                 : 9   Part-time    :  7    
##  IL     : 7    Computer Programmer     : 9   Retired      :  4    
##  CT     : 6    Administrative Assistant: 5   Self-employed:  4    
##  MN     : 6    Teacher                 : 5   Not available:  0    
##  (Other):67    (Other)                 :50   (Other)      :  0    
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   :  0.00           Mode :logical       Mode :logical   
##  1st Qu.: 26.00           FALSE:66            FALSE:107       
##  Median : 50.00           TRUE :65            TRUE :24        
##  Mean   : 74.24                                               
##  3rd Qu.:105.00                                               
##  Max.   :472.00                                               
##                                                               
##                     GroupKey   DateCreditPulled     CreditScoreRangeLower
##  783C3371218786870A73D20:  5   Min.   :2009-07-13   Min.   :600.0        
##  020E3366126106360DB9421:  1   1st Qu.:2009-10-19   1st Qu.:660.0        
##  17693364417023401A53169:  1   Median :2010-02-03   Median :720.0        
##  18DA336463918236939DCE7:  1   Mean   :2010-02-23   Mean   :711.1        
##  3D4D3366260257624AB272D:  1   3rd Qu.:2010-07-02   3rd Qu.:740.0        
##  (Other)                : 15   Max.   :2010-12-19   Max.   :860.0        
##  NA's                   :107                                             
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   :619.0         Min.   :1959-10-01      Min.   : 1.00     
##  1st Qu.:679.0         1st Qu.:1992-12-11      1st Qu.: 7.00     
##  Median :739.0         Median :1996-08-28      Median : 9.00     
##  Mean   :730.1         Mean   :1995-06-17      Mean   :10.27     
##  3rd Qu.:759.0         3rd Qu.:2000-04-07      3rd Qu.:13.00     
##  Max.   :879.0         Max.   :2007-09-10      Max.   :35.00     
##                                                                  
##  OpenCreditLines  TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : 1.000   Min.   : 4.00              Min.   : 0.000       
##  1st Qu.: 5.000   1st Qu.:17.00              1st Qu.: 4.000       
##  Median : 8.000   Median :22.00              Median : 6.000       
##  Mean   : 8.832   Mean   :25.51              Mean   : 6.855       
##  3rd Qu.:12.000   3rd Qu.:33.00              3rd Qu.: 9.000       
##  Max.   :29.000   Max.   :58.00              Max.   :29.000       
##                                                                   
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries  
##  Min.   :   0.0              Min.   :0.000        Min.   : 0.000  
##  1st Qu.:  90.5              1st Qu.:0.000        1st Qu.: 2.000  
##  Median : 239.0              Median :0.000        Median : 4.000  
##  Mean   : 309.1              Mean   :0.855        Mean   : 5.191  
##  3rd Qu.: 420.0              3rd Qu.:1.000        3rd Qu.: 8.000  
##  Max.   :1956.0              Max.   :9.000        Max.   :19.000  
##                                                                   
##  CurrentDelinquencies AmountDelinquent  DelinquenciesLast7Years
##  Min.   :0.0000       Min.   :    0.0   Min.   : 0.000         
##  1st Qu.:0.0000       1st Qu.:    0.0   1st Qu.: 0.000         
##  Median :0.0000       Median :    0.0   Median : 0.000         
##  Mean   :0.2824       Mean   :  433.7   Mean   : 2.718         
##  3rd Qu.:0.0000       3rd Qu.:    0.0   3rd Qu.: 0.000         
##  Max.   :8.0000       Max.   :31919.0   Max.   :43.000         
##                                                                
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   :0.0000           Min.   :0                 Min.   :    0         
##  1st Qu.:0.0000           1st Qu.:0                 1st Qu.: 2308         
##  Median :0.0000           Median :0                 Median : 8074         
##  Mean   :0.1756           Mean   :0                 Mean   :12039         
##  3rd Qu.:0.0000           3rd Qu.:0                 3rd Qu.:16422         
##  Max.   :3.0000           Max.   :0                 Max.   :97290         
##                                                                           
##  BankcardUtilization AvailableBankcardCredit  TotalTrades   
##  Min.   :0.0000      Min.   :     0          Min.   : 3.00  
##  1st Qu.:0.1800      1st Qu.:  1557          1st Qu.:14.50  
##  Median :0.4400      Median :  6999          Median :19.00  
##  Mean   :0.4524      Mean   : 13522          Mean   :22.21  
##  3rd Qu.:0.7200      3rd Qu.: 17470          3rd Qu.:29.00  
##  Max.   :0.9900      Max.   :110117          Max.   :52.00  
##                                                             
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   :0.3000            Min.   :0.0000          Min.   :0.0200   
##  1st Qu.:0.8400            1st Qu.:0.0000          1st Qu.:0.1100   
##  Median :0.9600            Median :0.0000          Median :0.2000   
##  Mean   :0.8996            Mean   :0.5725          Mean   :0.2500   
##  3rd Qu.:1.0000            3rd Qu.:1.0000          3rd Qu.:0.2725   
##  Max.   :1.0000            Max.   :5.0000          Max.   :5.5900   
##                                                    NA's   :11       
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  $50,000-74,999:45   Mode :logical    Min.   :  212.8    
##  $25,000-49,999:40   FALSE:11         1st Qu.: 3333.3    
##  $75,000-99,999:17   TRUE :120        Median : 4616.7    
##  $100,000+     :16                    Mean   : 5111.2    
##  $1-24,999     :13                    3rd Qu.: 6375.0    
##  Not displayed : 0                    Max.   :20833.3    
##  (Other)       : 0                                       
##                     LoanKey    TotalProsperLoans
##  003C35735230494626ADB02:  1   Min.   :1.000    
##  02CA35638190585257E0D22:  1   1st Qu.:1.000    
##  030B35936026115966F4EA0:  1   Median :1.000    
##  032A357638786716375DFFB:  1   Mean   :1.153    
##  040235782802629332A0C8C:  1   3rd Qu.:1.000    
##  05BC35722810324548A02FE:  1   Max.   :3.000    
##  (Other)                :125   NA's   :72       
##  TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   : 1.00              Min.   : 0.00        
##  1st Qu.:14.50              1st Qu.:14.50        
##  Median :24.00              Median :22.00        
##  Mean   :22.76              Mean   :22.54        
##  3rd Qu.:34.00              3rd Qu.:33.50        
##  Max.   :42.00              Max.   :41.00        
##  NA's   :72                 NA's   :72           
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   :0.0000                      Min.   :0                      
##  1st Qu.:0.0000                      1st Qu.:0                      
##  Median :0.0000                      Median :0                      
##  Mean   :0.2203                      Mean   :0                      
##  3rd Qu.:0.0000                      3rd Qu.:0                      
##  Max.   :3.0000                      Max.   :0                      
##  NA's   :72                          NA's   :72                     
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   : 1000            Min.   :   0.00            
##  1st Qu.: 1775            1st Qu.:   0.00            
##  Median : 4500            Median :   0.00            
##  Mean   : 5491            Mean   : 428.24            
##  3rd Qu.: 7500            3rd Qu.:   0.25            
##  Max.   :27000            Max.   :5788.52            
##  NA's   :72               NA's   :72                 
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-50.00              Min.   :   0.00          
##  1st Qu.: -7.00              1st Qu.:   0.00          
##  Median : 39.00              Median :   0.00          
##  Mean   : 43.37              Mean   :  53.65          
##  3rd Qu.: 83.00              3rd Qu.:   0.00          
##  Max.   :215.00              Max.   :1257.00          
##  NA's   :74                                           
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber   
##  Min.   :10.00                 Min.   :39.00              Min.   :38046  
##  1st Qu.:18.00                 1st Qu.:44.00              1st Qu.:39344  
##  Median :23.00                 Median :49.00              Median :40869  
##  Mean   :24.22                 Mean   :48.34              Mean   :41386  
##  3rd Qu.:32.00                 3rd Qu.:52.00              3rd Qu.:43474  
##  Max.   :37.00                 Max.   :56.00              Max.   :46378  
##  NA's   :122                                                             
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2009-07-22   Q4 2009:32            
##  1st Qu.: 2000      1st Qu.:2009-11-08   Q3 2009:26            
##  Median : 3000      Median :2010-02-17   Q2 2010:21            
##  Mean   : 4187      Mean   :2010-03-11   Q4 2010:21            
##  3rd Qu.: 5000      3rd Qu.:2010-07-18   Q1 2010:17            
##  Max.   :15000      Max.   :2010-12-30   Q3 2010:14            
##                                          (Other): 0            
##                    MemberKey   MonthlyLoanPayment LP_CustomerPayments
##  010B33941340101099BFE47:  1   Min.   :  0.00     Min.   :  458.2    
##  016533808792025682035EE:  1   1st Qu.: 63.24     1st Qu.: 2161.4    
##  0CCD3420393708396FB7287:  1   Median :111.95     Median : 3865.5    
##  0F1733815422230679CFC01:  1   Mean   :146.00     Mean   : 4865.0    
##  0F5133834635103374519DF:  1   3rd Qu.:188.66     3rd Qu.: 6402.7    
##  10D73380714543112C251DF:  1   Max.   :578.69     Max.   :18748.2    
##  (Other)                :125                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :  204.8              Min.   :  11.26    Min.   :-242.93  
##  1st Qu.: 1946.1              1st Qu.: 254.88    1st Qu.: -62.53  
##  Median : 3000.0              Median : 546.00    Median : -38.67  
##  Mean   : 4043.8              Mean   : 821.17    Mean   : -50.11  
##  3rd Qu.: 5000.0              3rd Qu.:1143.52    3rd Qu.: -19.86  
##  Max.   :15000.0              Max.   :3748.19    Max.   :  -1.41  
##                                                                   
##  LP_CollectionFees LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :0         Min.   :   0.0        Min.   :   0.0     
##  1st Qu.:0         1st Qu.:   0.0        1st Qu.:   0.0     
##  Median :0         Median :   0.0        Median :   0.0     
##  Mean   :0         Mean   : 145.4        Mean   : 145.4     
##  3rd Qu.:0         3rd Qu.:   0.0        3rd Qu.:   0.0     
##  Max.   :0         Max.   :8911.2        Max.   :8911.2     
##                                                             
##  LP_NonPrincipalRecoverypayments PercentFunded Recommendations  
##  Min.   :0                       Min.   :1     Min.   :0.00000  
##  1st Qu.:0                       1st Qu.:1     1st Qu.:0.00000  
##  Median :0                       Median :1     Median :0.00000  
##  Mean   :0                       Mean   :1     Mean   :0.08397  
##  3rd Qu.:0                       3rd Qu.:1     3rd Qu.:0.00000  
##  Max.   :0                       Max.   :1     Max.   :2.00000  
##                                                                 
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors    
##  Min.   :0.00000            Min.   :   0.00             Min.   : 10.0  
##  1st Qu.:0.00000            1st Qu.:   0.00             1st Qu.: 75.5  
##  Median :0.00000            Median :   0.00             Median :124.0  
##  Mean   :0.03817            Mean   :  57.97             Mean   :155.5  
##  3rd Qu.:0.00000            3rd Qu.:   0.00             3rd Qu.:204.0  
##  Max.   :1.00000            Max.   :5140.00             Max.   :594.0  
##                                                                        
##   PercentYield          Rating   
##  Min.   :-0.84307   NC     :  0  
##  1st Qu.: 0.06746   HR     :  0  
##  Median : 0.14544   E      :  0  
##  Mean   : 0.15497   D      :  0  
##  3rd Qu.: 0.26568   C      :  0  
##  Max.   : 0.59659   (Other):  0  
##                     NA's   :131
relevant_data %>% 
  mutate(Rating = coalesce(CreditGrade, ProsperRating.alpha)) %>%
  filter(!is.na(Rating)) %>%
  summary()
##                    ListingKey    ListingNumber     ListingCreationDate 
##  018A360063948152589C8BE:    2   Min.   :      4   Min.   :2005-11-09  
##  30F435938764424435A1188:    2   1st Qu.: 185813   1st Qu.:2007-08-12  
##  32943590099161153292459:    2   Median : 386826   Median :2008-08-22  
##  6DFC3591891372387BB41B2:    2   Mean   : 369148   Mean   :2009-07-15  
##  778D35919242972923313E0:    2   3rd Qu.: 524486   3rd Qu.:2011-08-29  
##  82FD35914405776692938D4:    2   Max.   :1204824   Max.   :2014-02-13  
##  (Other)                :54941                                         
##   CreditGrade         Term                        LoanStatus   
##  C      : 5648   Min.   :12.00   Completed             :37952  
##  D      : 5153   1st Qu.:36.00   Chargedoff            :11986  
##  B      : 4389   Median :36.00   Defaulted             : 5015  
##  AA     : 3509   Mean   :36.95   Cancelled             :    0  
##  HR     : 3505   3rd Qu.:36.00   Current               :    0  
##  (Other): 6744   Max.   :60.00   FinalPaymentInProgress:    0  
##  NA's   :26005                   (Other)               :    0  
##    ClosedDate          BorrowerAPR       BorrowerRate     LenderYield     
##  Min.   :2005-11-25   Min.   :0.00653   Min.   :0.0000   Min.   :-0.0100  
##  1st Qu.:2009-07-13   1st Qu.:0.14974   1st Qu.:0.1350   1st Qu.: 0.1250  
##  Median :2011-04-04   Median :0.21434   Median :0.1945   Median : 0.1826  
##  Mean   :2011-03-07   Mean   :0.22228   Mean   :0.2005   Mean   : 0.1904  
##  3rd Qu.:2013-01-30   3rd Qu.:0.29510   3rd Qu.:0.2699   3rd Qu.: 0.2575  
##  Max.   :2014-03-10   Max.   :0.51229   Max.   :0.4975   Max.   : 0.4925  
##                       NA's   :25                                          
##  EstimatedEffectiveYield EstimatedLoss   EstimatedReturn 
##  Min.   :-0.183          Min.   :0.005   Min.   :-0.183  
##  1st Qu.: 0.111          1st Qu.:0.052   1st Qu.: 0.078  
##  Median : 0.172          Median :0.098   Median : 0.114  
##  Mean   : 0.176          Mean   :0.094   Mean   : 0.108  
##  3rd Qu.: 0.247          3rd Qu.:0.140   3rd Qu.: 0.136  
##  Max.   : 0.320          Max.   :0.366   Max.   : 0.284  
##  NA's   :28948           NA's   :28948   NA's   :28948   
##  ProsperRating.num ProsperRating.alpha  ProsperScore   
##  Min.   :1.000     D      : 5869       Min.   : 1.000  
##  1st Qu.:2.000     E      : 3830       1st Qu.: 5.000  
##  Median :3.000     C      : 3817       Median : 6.000  
##  Mean   :3.663     HR     : 3725       Mean   : 6.266  
##  3rd Qu.:5.000     A      : 3608       3rd Qu.: 8.000  
##  Max.   :7.000     (Other): 5156       Max.   :11.000  
##  NA's   :28948     NA's   :28948       NA's   :28948   
##  ListingCategory.num BorrowerState                      Occupation   
##  1      :17801       CA     : 7245   Other                   :14025  
##  0      :16948       FL     : 3073   Professional            : 6492  
##  7      : 6018       IL     : 3032   Computer Programmer     : 2485  
##  3      : 4240       GA     : 2778   Administrative Assistant: 1928  
##  2      : 3233       TX     : 2734   Sales - Commission      : 1809  
##  4      : 2395       (Other):30579   (Other)                 :25949  
##  (Other): 4318       NA's   : 5512   NA's                    : 2265  
##       EmploymentStatus EmploymentStatusDuration IsBorrowerHomeowner
##  Full-time    :24853   Min.   :  0.00           Mode :logical      
##  Employed     :16479   1st Qu.: 21.00           FALSE:29133        
##  Not available: 5346   Median : 52.00           TRUE :25820        
##  Self-employed: 2922   Mean   : 80.91                              
##  Part-time    : 1049   3rd Qu.:112.00                              
##  (Other)      : 2052   Max.   :755.00                              
##  NA's         : 2252   NA's   :7611                                
##  CurrentlyInGroup                    GroupKey     DateCreditPulled    
##  Mode :logical    783C3371218786870A73D20: 1056   Min.   :2005-11-09  
##  FALSE:43133      3D4D3366260257624AB272D:  805   1st Qu.:2007-08-04  
##  TRUE :11820      6A3B336601725506917317E:  672   Median :2008-08-19  
##                   FEF83377364176536637E50:  581   Mean   :2009-07-13  
##                   C9643379247860156A00EC0:  342   3rd Qu.:2011-08-30  
##                   (Other)                : 9188   Max.   :2014-02-13  
##                   NA's                   :42309                       
##  CreditScoreRangeLower CreditScoreRangeUpper FirstRecordedCreditLine
##  Min.   :  0.0         Min.   : 19.0         Min.   :1947-08-24     
##  1st Qu.:640.0         1st Qu.:659.0         1st Qu.:1990-09-26     
##  Median :680.0         Median :699.0         Median :1995-10-09     
##  Mean   :671.7         Mean   :690.7         Mean   :1994-11-30     
##  3rd Qu.:720.0         3rd Qu.:739.0         3rd Qu.:2000-01-01     
##  Max.   :880.0         Max.   :899.0         Max.   :2012-06-19     
##  NA's   :590           NA's   :590           NA's   :696            
##  CurrentCreditLines OpenCreditLines  TotalCreditLinespast7years
##  Min.   : 0.000     Min.   : 0.000   Min.   :  2.00            
##  1st Qu.: 6.000     1st Qu.: 5.000   1st Qu.: 15.00            
##  Median : 9.000     Median : 8.000   Median : 23.00            
##  Mean   : 9.567     Mean   : 8.336   Mean   : 25.28            
##  3rd Qu.:13.000     3rd Qu.:11.000   3rd Qu.: 33.00            
##  Max.   :59.000     Max.   :51.000   Max.   :136.00            
##  NA's   :7600       NA's   :7600     NA's   :696               
##  OpenRevolvingAccounts OpenRevolvingMonthlyPayment InquiriesLast6Months
##  Min.   : 0.000        Min.   :    0.0             Min.   :  0.000     
##  1st Qu.: 3.000        1st Qu.:   60.0             1st Qu.:  0.000     
##  Median : 5.000        Median :  183.0             Median :  1.000     
##  Mean   : 6.077        Mean   :  325.3             Mean   :  2.055     
##  3rd Qu.: 8.000        3rd Qu.:  418.0             3rd Qu.:  3.000     
##  Max.   :51.000        Max.   :14985.0             Max.   :105.000     
##                                                    NA's   :696         
##  TotalInquiries    CurrentDelinquencies AmountDelinquent
##  Min.   :  0.000   Min.   : 0.0000      Min.   :     0  
##  1st Qu.:  2.000   1st Qu.: 0.0000      1st Qu.:     0  
##  Median :  5.000   Median : 0.0000      Median :     0  
##  Mean   :  7.171   Mean   : 0.9077      Mean   :  1052  
##  3rd Qu.:  9.000   3rd Qu.: 1.0000      3rd Qu.:     0  
##  Max.   :379.000   Max.   :83.0000      Max.   :444745  
##  NA's   :1155      NA's   :696          NA's   :7618    
##  DelinquenciesLast7Years PublicRecordsLast10Years
##  Min.   : 0.000          Min.   : 0.0000         
##  1st Qu.: 0.000          1st Qu.: 0.0000         
##  Median : 0.000          Median : 0.0000         
##  Mean   : 4.576          Mean   : 0.3314         
##  3rd Qu.: 4.000          3rd Qu.: 0.0000         
##  Max.   :99.000          Max.   :30.0000         
##  NA's   :987             NA's   :696             
##  PublicRecordsLast12Months RevolvingCreditBalance BankcardUtilization
##  Min.   :0.000             Min.   :      0        Min.   :0.000      
##  1st Qu.:0.000             1st Qu.:   1613        1st Qu.:0.210      
##  Median :0.000             Median :   6071        Median :0.570      
##  Mean   :0.024             Mean   :  15700        Mean   :0.531      
##  3rd Qu.:0.000             3rd Qu.:  16169        3rd Qu.:0.850      
##  Max.   :7.000             Max.   :1435667        Max.   :5.950      
##  NA's   :7600              NA's   :7600           NA's   :7600       
##  AvailableBankcardCredit  TotalTrades     TradesNeverDelinquent.per
##  Min.   :     0          Min.   :  0.00   Min.   :0.000            
##  1st Qu.:   505          1st Qu.: 13.00   1st Qu.:0.770            
##  Median :  3236          Median : 20.00   Median :0.920            
##  Mean   : 10837          Mean   : 21.79   Mean   :0.856            
##  3rd Qu.: 12032          3rd Qu.: 29.00   3rd Qu.:1.000            
##  Max.   :646285          Max.   :126.00   Max.   :1.000            
##  NA's   :7540            NA's   :7540     NA's   :7540             
##  TradesOpenedLast6Months DebtToIncomeRatio         IncomeRange   
##  Min.   : 0.00           Min.   : 0.00     $25,000-49,999:16303  
##  1st Qu.: 0.00           1st Qu.: 0.13     $50,000-74,999:12744  
##  Median : 1.00           Median : 0.20     Not displayed : 7737  
##  Mean   : 0.91           Mean   : 0.29     $75,000-99,999: 6425  
##  3rd Qu.: 1.00           3rd Qu.: 0.30     $100,000+     : 6048  
##  Max.   :20.00           Max.   :10.01     $1-24,999     : 4558  
##  NA's   :7540            NA's   :4219      (Other)       : 1138  
##  IncomeVerifiable StatedMonthlyIncome                    LoanKey     
##  Mode :logical    Min.   :     0      08C43696561586194AC381C:    2  
##  FALSE:4289       1st Qu.:  2808      09303699897852595CD59DD:    2  
##  TRUE :50664      Median :  4167      114D37056655628721BD6C8:    2  
##                   Mean   :  5055      156836977849742636AE34F:    2  
##                   3rd Qu.:  6250      56D73700259224545E36FBC:    2  
##                   Max.   :618548      63113695530739927C7EA06:    2  
##                                       (Other)                :54941  
##  TotalProsperLoans TotalProsperPaymentsBilled OnTimeProsperPayments
##  Min.   :0.00      Min.   :  0.00             Min.   :  0.00       
##  1st Qu.:1.00      1st Qu.:  9.00             1st Qu.:  9.00       
##  Median :1.00      Median : 15.00             Median : 14.00       
##  Mean   :1.33      Mean   : 20.06             Mean   : 19.47       
##  3rd Qu.:1.00      3rd Qu.: 28.00             3rd Qu.: 27.00       
##  Max.   :7.00      Max.   :120.00             Max.   :114.00       
##  NA's   :44473     NA's   :44473              NA's   :44473        
##  ProsperPaymentsLessThanOneMonthLate ProsperPaymentsOneMonthPlusLate
##  Min.   : 0.00                       Min.   : 0.00                  
##  1st Qu.: 0.00                       1st Qu.: 0.00                  
##  Median : 0.00                       Median : 0.00                  
##  Mean   : 0.55                       Mean   : 0.05                  
##  3rd Qu.: 0.00                       3rd Qu.: 0.00                  
##  Max.   :42.00                       Max.   :21.00                  
##  NA's   :44473                       NA's   :44473                  
##  ProsperPrincipalBorrowed ProsperPrincipalOutstanding
##  Min.   :    0            Min.   :    0              
##  1st Qu.: 3000            1st Qu.:    0              
##  Median : 5000            Median : 1113              
##  Mean   : 7114            Mean   : 2343              
##  3rd Qu.: 9500            3rd Qu.: 3395              
##  Max.   :60001            Max.   :22587              
##  NA's   :44473            NA's   :44473              
##  ScorexChangeAtTimeOfListing LoanCurrentDaysDelinquent
##  Min.   :-194.00             Min.   :   0.0           
##  1st Qu.: -32.00             1st Qu.:   0.0           
##  Median :   0.00             Median :   0.0           
##  Mean   :   0.92             Mean   : 315.1           
##  3rd Qu.:  32.00             3rd Qu.: 230.0           
##  Max.   : 286.00             Max.   :2704.0           
##  NA's   :44568                                        
##  LoanFirstDefaultedCycleNumber LoanMonthsSinceOrigination   LoanNumber    
##  Min.   : 0.00                 Min.   :  1.00             Min.   :     1  
##  1st Qu.: 9.00                 1st Qu.: 30.00             1st Qu.: 18264  
##  Median :14.00                 Median : 66.00             Median : 36288  
##  Mean   :16.27                 Mean   : 55.64             Mean   : 37930  
##  3rd Qu.:22.00                 3rd Qu.: 79.00             3rd Qu.: 53290  
##  Max.   :44.00                 Max.   :100.00             Max.   :132453  
##  NA's   :38018                                                            
##  LoanOriginalAmount LoanOriginationDate  LoanOriginationQuarter
##  Min.   : 1000      Min.   :2005-11-15   Q2 2008: 4344         
##  1st Qu.: 2650      1st Qu.:2007-08-23   Q3 2008: 3602         
##  Median : 4500      Median :2008-09-04   Q2 2007: 3118         
##  Mean   : 6267      Mean   :2009-07-28   Q1 2007: 3079         
##  3rd Qu.: 8000      3rd Qu.:2011-09-14   Q1 2008: 3074         
##  Max.   :35000      Max.   :2014-02-21   (Other):37714         
##                                          NA's   :   22         
##                    MemberKey     MonthlyLoanPayment LP_CustomerPayments
##  16083364744933457E57FB9:    8   Min.   :   0.00    Min.   :   -2.35   
##  63CA34120866140639431C9:    8   1st Qu.:  98.57    1st Qu.: 2029.68   
##  739C338135235294782AE75:    8   Median : 172.60    Median : 4209.75   
##  7E1733653050264822FAA3D:    8   Mean   : 223.92    Mean   : 5930.22   
##  C70934206057523078260C7:    8   3rd Qu.: 300.67    3rd Qu.: 7939.28   
##  458E33818543661332BC1BE:    7   Max.   :2251.51    Max.   :40702.39   
##  (Other)                :54906                                         
##  LP_CustomerPrincipalPayments LP_InterestandFees LP_ServiceFees   
##  Min.   :    0                Min.   :   -2.35   Min.   :-664.87  
##  1st Qu.: 1351                1st Qu.:  331.35   1st Qu.: -73.06  
##  Median : 3159                Median :  764.41   Median : -34.25  
##  Mean   : 4802                Mean   : 1127.81   Mean   : -53.67  
##  3rd Qu.: 6500                3rd Qu.: 1510.31   3rd Qu.: -14.64  
##  Max.   :35000                Max.   :15617.03   Max.   :  32.06  
##                                                                   
##  LP_CollectionFees  LP_GrossPrincipalLoss LP_NetPrincipalLoss
##  Min.   :-9274.75   Min.   :  -94.2       Min.   : -954.5    
##  1st Qu.:    0.00   1st Qu.:    0.0       1st Qu.:    0.0    
##  Median :    0.00   Median :    0.0       Median :    0.0    
##  Mean   :  -25.06   Mean   : 1451.9       Mean   : 1412.5    
##  3rd Qu.:    0.00   3rd Qu.: 1446.9       3rd Qu.: 1312.3    
##  Max.   :    0.00   Max.   :25000.0       Max.   :25000.0    
##                                                              
##  LP_NonPrincipalRecoverypayments PercentFunded    Recommendations  
##  Min.   :    0.00                Min.   :0.7000   Min.   : 0.0000  
##  1st Qu.:    0.00                1st Qu.:1.0000   1st Qu.: 0.0000  
##  Median :    0.00                Median :1.0000   Median : 0.0000  
##  Mean   :   52.13                Mean   :0.9986   Mean   : 0.0894  
##  3rd Qu.:    0.00                3rd Qu.:1.0000   3rd Qu.: 0.0000  
##  Max.   :21117.90                Max.   :1.0110   Max.   :39.0000  
##                                                                    
##  InvestmentFromFriendsCount InvestmentFromFriendsAmount   Investors     
##  Min.   : 0.00000           Min.   :    0.00            Min.   :   1.0  
##  1st Qu.: 0.00000           1st Qu.:    0.00            1st Qu.:  31.0  
##  Median : 0.00000           Median :    0.00            Median :  69.0  
##  Mean   : 0.04613           Mean   :   33.53            Mean   : 104.8  
##  3rd Qu.: 0.00000           3rd Qu.:    0.00            3rd Qu.: 142.0  
##  Max.   :33.00000           Max.   :25000.00            Max.   :1189.0  
##                                                                         
##   PercentYield          Rating     
##  Min.   :-1.00092   D      :11022  
##  1st Qu.:-0.17136   C      : 9465  
##  Median : 0.10319   B      : 7762  
##  Mean   :-0.02488   HR     : 7230  
##  3rd Qu.: 0.22512   E      : 7119  
##  Max.   : 1.72976   A      : 6922  
##                     (Other): 5433

It’s not clear what distinguishes the NA Prosper ratings from others, whether this rating was assigned from the beginning, or why loans in this category are so profitable. These loans were all created in 2009 and 2010, however.

ProsperScore

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = ProsperScore)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  labs(title = "Loan Status by ProsperScore")

As can be seen, loans that were completed had the best risk score, although those that were charged off or defaulted are not markedly lower.

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = factor(ProsperScore), y=PercentYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  labs(title = "Percent Yield by ProsperScore")

Estimated Profit vs. Actual Profit

EstimatedEffectiveYield

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = EstimatedEffectiveYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  labs(title = "Loan Status by EstimatedEffectiveYield")

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = EstimatedEffectiveYield, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

EstimatedLoss

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = EstimatedLoss)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  labs(title = "Loan Status by EstimatedLoss")

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = EstimatedLoss, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

EstimatedReturn

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = EstimatedReturn)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = EstimatedReturn, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

Demographics vs. Actual Profit

Occupation

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(LoanStatus, Occupation) %>% 
  tally() %>%
  spread(LoanStatus, n) %>%
  replace_na(list(Completed=0, Chargedoff=0, Defaulted=0)) %>%
  mutate(percent = Completed/(Completed+Chargedoff+Defaulted)) %>%
  mutate(Occupation = reorder(Occupation, percent))

ggplot(plot_data, aes(x = Occupation, y=percent)) +
  geom_bar(stat="identity") +
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

PercentYield
plot_data <- relevant_data %>% 
  group_by(Occupation) %>% 
  summarize(percent = mean(PercentYield, na.rm=TRUE)) %>%
  mutate(Occupation = reorder(Occupation, percent))

ggplot(plot_data, aes(x = Occupation, y=percent)) +
  geom_bar(stat="identity") +
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

There are too many occupations to make easy generalizations. Occupations would likely need to be grouped into a smaller number of categories. However, one can observe general trends - those with higher-paying occupations (or occupational prospects) seem to be more profitable customers. On the other hand, students in general are among the bank’s least profitable customers. This suggests that income, which is grouped in a more sensible manner, may be useful to look at.

IncomeRange

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(LoanStatus, IncomeRange) %>% 
  tally() %>%
  spread(LoanStatus, n) %>%
  replace_na(list(Completed=0, Chargedoff=0, Defaulted=0)) %>%
  mutate(PercentCompleted = Completed/(Completed+Chargedoff+Defaulted)) %>%
  mutate(IncomeRange = reorder(IncomeRange, PercentCompleted))

ggplot(plot_data, aes(x = IncomeRange, y=PercentCompleted)) +
  geom_bar(stat="identity") +
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = IncomeRange, y=PercentYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

Here, it can be seen that as income rises, the ProsperRating increases, and other measures of profit decrease. As we have seen, ProsperRating correlates with credit score and likelihood of not defaulting. This suggests that high-income lenders are lower-risk, but lower-income lenders, while being higher-risk, can also yield more profit.

EmploymentStatus

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(LoanStatus, EmploymentStatus) %>% 
  tally() %>%
  spread(LoanStatus, n) %>%
  replace_na(list(Completed=0, Chargedoff=0, Defaulted=0)) %>%
  mutate(PercentCompleted = Completed/(Completed+Chargedoff+Defaulted)) %>%
  mutate(EmploymentStatus = reorder(EmploymentStatus, PercentCompleted))

ggplot(plot_data, aes(x = EmploymentStatus, y=PercentCompleted)) +
  geom_bar(stat="identity") +
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = EmploymentStatus, y=PercentYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

What it looks like here is that Prosper ratings are highest for those employed, and employed full-time (it’s not clear what the difference is), lower for those who are self-employed, retired, work part-time, or ‘other,’ and much lower for those not employed. LenderYield, EstimatedEffectiveYield, and EstimatedReturn, however, are highest for those not employed, likely reflecting the higher anticipated interest charged to people in that group. Estimated Loss, correspondingly, is also highest for those not employed - there’s higher potential profit if the loans are paid back, but also significantly more risk.

EmploymentStatusDuration

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = EmploymentStatusDuration)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = EmploymentStatusDuration, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

IsBorrowerHomeowner

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(LoanStatus) %>%
  summarise(percent = mean(IsBorrowerHomeowner, na.rm=TRUE))

ggplot(plot_data, aes(x = LoanStatus, y = percent, cumulative = TRUE)) +
    geom_col()

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = IsBorrowerHomeowner, y=PercentYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

CreditScoreRangeLower

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = CreditScoreRangeLower)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = CreditScoreRangeLower, y=PercentYield)) +
  geom_hex(bins = 25) +
  geom_smooth(formula = y~x)

plot_data <- relevant_data %>% filter(CreditScoreRangeLower>400)

ggplot(plot_data, aes(x = CreditScoreRangeLower, y=PercentYield)) +
  geom_hex(bins = 25) +
  geom_smooth(formula = y~x)

CreditScoreRangeUpper

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = CreditScoreRangeUpper)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = CreditScoreRangeUpper, y=PercentYield)) +
  geom_hex(bins = 25) +
  geom_smooth(formula = y~x)

plot_data <- relevant_data %>% filter(CreditScoreRangeUpper>400)

ggplot(plot_data, aes(x = CreditScoreRangeUpper, y=PercentYield)) +
  geom_hex(bins = 25) +
  geom_smooth(formula = y~x)

FirstRecordedCreditLine

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = FirstRecordedCreditLine)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = FirstRecordedCreditLine, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

OpenRevolvingAccounts

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = OpenRevolvingAccounts)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = OpenRevolvingAccounts, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

InquiriesLast6Months

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = InquiriesLast6Months)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = InquiriesLast6Months, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

AmountDelinquent

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = AmountDelinquent)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = AmountDelinquent, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

DelinquenciesLast7Years

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = DelinquenciesLast7Years)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = DelinquenciesLast7Years, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

RevolvingCreditBalance

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = RevolvingCreditBalance)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = RevolvingCreditBalance, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

BankcardUtilization

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = BankcardUtilization)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = BankcardUtilization, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

DebtToIncomeRatio

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = DebtToIncomeRatio)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = DebtToIncomeRatio, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

IncomeVerifiable

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(LoanStatus) %>%
  summarise(percent = mean(IncomeVerifiable, na.rm=TRUE))

ggplot(plot_data, aes(x = LoanStatus, y = percent, cumulative = TRUE)) +
    geom_col()

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = IncomeVerifiable, y=PercentYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

TotalTrades

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = TotalTrades)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = TotalTrades, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

TradesNeverDelinquent

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = TradesNeverDelinquent.per)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = TradesNeverDelinquent.per, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

Other Prosper Data vs. Actual Profit

TotalProsperLoans

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = TotalProsperLoans)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = factor(TotalProsperLoans), y=PercentYield)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

OnTimeProsperPayments

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = OnTimeProsperPayments)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = OnTimeProsperPayments, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

ProsperPrincipalOutstanding

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = ProsperPrincipalOutstanding)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = ProsperPrincipalOutstanding, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

Recommendations

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = Recommendations)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = Recommendations, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

InvestmentFromFriendsCount

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = InvestmentFromFriendsCount)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = InvestmentFromFriendsCount, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

InvestmentFromFriendsAmount

LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted")))

ggplot(plot_data, aes(x = LoanStatus, y = InvestmentFromFriendsAmount)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal)

PercentYield
plot_data <- relevant_data

ggplot(plot_data, aes(x = InvestmentFromFriendsAmount, y=PercentYield)) +
  geom_hex(bins = 50) +
  geom_smooth(formula = y~x)

LoanOriginationQuarter

summary(filter(relevant_data, is.na(LoanOriginationQuarter)))
##                    ListingKey ListingNumber   ListingCreationDate 
##  044B3365298516680DA929B: 1   Min.   : 4.00   Min.   :2005-11-09  
##  0B4133652604109810CAA3B: 1   1st Qu.:18.25   1st Qu.:2005-11-18  
##  0E0F336443449038617E9F4: 1   Median :23.50   Median :2005-11-20  
##  2F123364529418907A58D4C: 1   Mean   :26.05   Mean   :2005-11-22  
##  2F25336514614362295DA03: 1   3rd Qu.:35.75   3rd Qu.:2005-11-28  
##  3480336511078238810A782: 1   Max.   :59.00   Max.   :2005-12-21  
##  (Other)                :16                                       
##   CreditGrade      Term                     LoanStatus
##  AA     :12   Min.   :36   Completed             :22  
##  HR     : 2   1st Qu.:36   Cancelled             : 0  
##  C      : 2   Median :36   Chargedoff            : 0  
##  B      : 2   Mean   :36   Current               : 0  
##  NC     : 1   3rd Qu.:36   Defaulted             : 0  
##  E      : 1   Max.   :36   FinalPaymentInProgress: 0  
##  (Other): 2                (Other)               : 0  
##    ClosedDate          BorrowerAPR   BorrowerRate      LenderYield     
##  Min.   :2005-11-25   Min.   : NA   Min.   :0.04000   Min.   :0.03500  
##  1st Qu.:2005-12-02   1st Qu.: NA   1st Qu.:0.06099   1st Qu.:0.05500  
##  Median :2006-01-06   Median : NA   Median :0.08500   Median :0.07500  
##  Mean   :2006-08-15   Mean   :NaN   Mean   :0.09370   Mean   :0.08308  
##  3rd Qu.:2006-08-30   3rd Qu.: NA   3rd Qu.:0.11500   3rd Qu.:0.09500  
##  Max.   :2008-12-30   Max.   : NA   Max.   :0.25000   Max.   :0.24500  
##                       NA's   :22                                       
##  EstimatedEffectiveYield EstimatedLoss EstimatedReturn ProsperRating.num
##  Min.   : NA             Min.   : NA   Min.   : NA     Min.   : NA      
##  1st Qu.: NA             1st Qu.: NA   1st Qu.: NA     1st Qu.: NA      
##  Median : NA             Median : NA   Median : NA     Median : NA      
##  Mean   :NaN             Mean   :NaN   Mean   :NaN     Mean   :NaN      
##  3rd Qu.: NA             3rd Qu.: NA   3rd Qu.: NA     3rd Qu.: NA      
##  Max.   : NA             Max.   : NA   Max.   : NA     Max.   : NA      
##  NA's   :22              NA's   :22    NA's   :22      NA's   :22       
##  ProsperRating.alpha  ProsperScore ListingCategory.num BorrowerState
##  NC     : 0          Min.   : NA   0      :22          AK     : 0   
##  HR     : 0          1st Qu.: NA   1      : 0          AL     : 0   
##  E      : 0          Median : NA   2      : 0          AR     : 0   
##  D      : 0          Mean   :NaN   3      : 0          AZ     : 0   
##  C      : 0          3rd Qu.: NA   4      : 0          CA     : 0   
##  (Other): 0          Max.   : NA   5      : 0          (Other): 0   
##  NA's   :22          NA's   :22    (Other): 0          NA's   :22   
##                     Occupation      EmploymentStatus
##  Accountant/CPA          : 0   Employed     : 0     
##  Administrative Assistant: 0   Full-time    : 0     
##  Analyst                 : 0   Not available: 0     
##  Architect               : 0   Not employed : 0     
##  Attorney                : 0   Other        : 0     
##  (Other)                 : 0   (Other)      : 0     
##  NA's                    :22   NA's         :22     
##  EmploymentStatusDuration IsBorrowerHomeowner CurrentlyInGroup
##  Min.   : NA              Mode :logical       Mode :logical   
##  1st Qu.: NA              FALSE:22            FALSE:22        
##  Median : NA                                                  
##  Mean   :NaN                                                  
##  3rd Qu.: NA                                                  
##  Max.   : NA                                                  
##  NA's   :22                                                   
##                     GroupKey DateCreditPulled     CreditScoreRangeLower
##  B8143364846229046768A83:4   Min.   :2005-11-09   Min.   : NA          
##  12D7336581480170815332C:2   1st Qu.:2005-11-16   1st Qu.: NA          
##  5BE63365249159793785758:2   Median :2005-11-18   Median : NA          
##  F0B53365823807576457B84:2   Mean   :2005-11-21   Mean   :NaN          
##  94E9336577086235891524E:1   3rd Qu.:2005-11-28   3rd Qu.: NA          
##  (Other)                :4   Max.   :2005-12-20   Max.   : NA          
##  NA's                   :7                        NA's   :22           
##  CreditScoreRangeUpper FirstRecordedCreditLine CurrentCreditLines
##  Min.   : NA           Min.   :NA              Min.   : NA       
##  1st Qu.: NA           1st Qu.:NA              1st Qu.: NA       
##  Median : NA           Median :NA              Median : NA       
##  Mean   :NaN           Mean   :NA              Mean   :NaN       
##  3rd Qu.: NA           3rd Qu.:NA              3rd Qu.: NA       
##  Max.   : NA           Max.   :NA              Max.   : NA       
##  NA's   :22            NA's   :22              NA's   :22        
##  OpenCreditLines TotalCreditLinespast7years OpenRevolvingAccounts
##  Min.   : NA     Min.   : NA                Min.   :0            
##  1st Qu.: NA     1st Qu.: NA                1st Qu.:0            
##  Median : NA     Median : NA                Median :0            
##  Mean   :NaN     Mean   :NaN                Mean   :0            
##  3rd Qu.: NA     3rd Qu.: NA                3rd Qu.:0            
##  Max.   : NA     Max.   : NA                Max.   :0            
##  NA's   :22      NA's   :22                                      
##  OpenRevolvingMonthlyPayment InquiriesLast6Months TotalInquiries
##  Min.   :0                   Min.   : NA          Min.   : NA   
##  1st Qu.:0                   1st Qu.: NA          1st Qu.: NA   
##  Median :0                   Median : NA          Median : NA   
##  Mean   :0                   Mean   :NaN          Mean   :NaN   
##  3rd Qu.:0                   3rd Qu.: NA          3rd Qu.: NA   
##  Max.   :0                   Max.   : NA          Max.   : NA   
##                              NA's   :22           NA's   :22    
##  CurrentDelinquencies AmountDelinquent DelinquenciesLast7Years
##  Min.   : NA          Min.   : NA      Min.   : NA            
##  1st Qu.: NA          1st Qu.: NA      1st Qu.: NA            
##  Median : NA          Median : NA      Median : NA            
##  Mean   :NaN          Mean   :NaN      Mean   :NaN            
##  3rd Qu.: NA          3rd Qu.: NA      3rd Qu.: NA            
##  Max.   : NA          Max.   : NA      Max.   : NA            
##  NA's   :22           NA's   :22       NA's   :22             
##  PublicRecordsLast10Years PublicRecordsLast12Months RevolvingCreditBalance
##  Min.   : NA              Min.   : NA               Min.   : NA           
##  1st Qu.: NA              1st Qu.: NA               1st Qu.: NA           
##  Median : NA              Median : NA               Median : NA           
##  Mean   :NaN              Mean   :NaN               Mean   :NaN           
##  3rd Qu.: NA              3rd Qu.: NA               3rd Qu.: NA           
##  Max.   : NA              Max.   : NA               Max.   : NA           
##  NA's   :22               NA's   :22                NA's   :22            
##  BankcardUtilization AvailableBankcardCredit  TotalTrades 
##  Min.   : NA         Min.   : NA             Min.   : NA  
##  1st Qu.: NA         1st Qu.: NA             1st Qu.: NA  
##  Median : NA         Median : NA             Median : NA  
##  Mean   :NaN         Mean   :NaN             Mean   :NaN  
##  3rd Qu.: NA         3rd Qu.: NA             3rd Qu.: NA  
##  Max.   : NA         Max.   : NA             Max.   : NA  
##  NA's   :22          NA's   :22              NA's   :22   
##  TradesNeverDelinquent.per TradesOpenedLast6Months DebtToIncomeRatio
##  Min.   : NA               Min.   : NA             Min.   :0.01051  
##  1st Qu.: NA               1st Qu.: NA             1st Qu.:0.01569  
##  Median : NA               Median : NA             Median :0.02714  
##  Mean   :NaN               Mean   :NaN             Mean   :0.07178  
##  3rd Qu.: NA               3rd Qu.: NA             3rd Qu.:0.07534  
##  Max.   : NA               Max.   : NA             Max.   :0.38380  
##  NA's   :22                NA's   :22                               
##          IncomeRange IncomeVerifiable StatedMonthlyIncome
##  Not displayed :22   Mode:logical     Min.   : 1083      
##  Not employed  : 0   TRUE:22          1st Qu.: 6771      
##  $0            : 0                    Median : 9312      
##  $1-24,999     : 0                    Mean   :11123      
##  $25,000-49,999: 0                    3rd Qu.:14062      
##  $50,000-74,999: 0                    Max.   :29167      
##  (Other)       : 0                                       
##                     LoanKey   TotalProsperLoans TotalProsperPaymentsBilled
##  051C3366339161583A81E4D: 1   Min.   : NA       Min.   : NA               
##  11463365963100969351D1D: 1   1st Qu.: NA       1st Qu.: NA               
##  30FD3365652573455326F15: 1   Median : NA       Median : NA               
##  31AC3364816494648054FCB: 1   Mean   :NaN       Mean   :NaN               
##  32233364725508802D1C433: 1   3rd Qu.: NA       3rd Qu.: NA               
##  335E3365194260894C5E804: 1   Max.   : NA       Max.   : NA               
##  (Other)                :16   NA's   :22        NA's   :22                
##  OnTimeProsperPayments ProsperPaymentsLessThanOneMonthLate
##  Min.   : NA           Min.   : NA                        
##  1st Qu.: NA           1st Qu.: NA                        
##  Median : NA           Median : NA                        
##  Mean   :NaN           Mean   :NaN                        
##  3rd Qu.: NA           3rd Qu.: NA                        
##  Max.   : NA           Max.   : NA                        
##  NA's   :22            NA's   :22                         
##  ProsperPaymentsOneMonthPlusLate ProsperPrincipalBorrowed
##  Min.   : NA                     Min.   : NA             
##  1st Qu.: NA                     1st Qu.: NA             
##  Median : NA                     Median : NA             
##  Mean   :NaN                     Mean   :NaN             
##  3rd Qu.: NA                     3rd Qu.: NA             
##  Max.   : NA                     Max.   : NA             
##  NA's   :22                      NA's   :22              
##  ProsperPrincipalOutstanding ScorexChangeAtTimeOfListing
##  Min.   : NA                 Min.   : NA                
##  1st Qu.: NA                 1st Qu.: NA                
##  Median : NA                 Median : NA                
##  Mean   :NaN                 Mean   :NaN                
##  3rd Qu.: NA                 3rd Qu.: NA                
##  Max.   : NA                 Max.   : NA                
##  NA's   :22                  NA's   :22                 
##  LoanCurrentDaysDelinquent LoanFirstDefaultedCycleNumber
##  Min.   :0                 Min.   : NA                  
##  1st Qu.:0                 1st Qu.: NA                  
##  Median :0                 Median : NA                  
##  Mean   :0                 Mean   :NaN                  
##  3rd Qu.:0                 3rd Qu.: NA                  
##  Max.   :0                 Max.   : NA                  
##                            NA's   :22                   
##  LoanMonthsSinceOrigination   LoanNumber    LoanOriginalAmount
##  Min.   : 99.00             Min.   : 1.00   Min.   : 1000     
##  1st Qu.: 99.00             1st Qu.: 6.25   1st Qu.: 1500     
##  Median :100.00             Median :11.50   Median : 3000     
##  Mean   : 99.59             Mean   :11.50   Mean   : 3577     
##  3rd Qu.:100.00             3rd Qu.:16.75   3rd Qu.: 4150     
##  Max.   :100.00             Max.   :22.00   Max.   :15000     
##                                                               
##  LoanOriginationDate  LoanOriginationQuarter                   MemberKey 
##  Min.   :2005-11-15   Q1 2006: 0             D3123364665672102D89C63: 2  
##  1st Qu.:2005-11-25   Q2 2006: 0             0A8633658381202043D0226: 1  
##  Median :2005-11-28   Q3 2006: 0             0FE0336637558007610834C: 1  
##  Mean   :2005-12-01   Q4 2006: 0             10983364491040266AF6111: 1  
##  3rd Qu.:2005-12-07   Q1 2007: 0             12C53364471219226F478E8: 1  
##  Max.   :2005-12-30   (Other): 0             4C9A3364566879406D66E65: 1  
##                       NA's   :22             (Other)                :15  
##  MonthlyLoanPayment LP_CustomerPayments LP_CustomerPrincipalPayments
##  Min.   :  0.00     Min.   : 1000       Min.   : 1000               
##  1st Qu.: 33.82     1st Qu.: 1517       1st Qu.: 1500               
##  Median : 85.27     Median : 3006       Median : 3000               
##  Mean   :102.43     Mean   : 3772       Mean   : 3577               
##  3rd Qu.:121.65     3rd Qu.: 4586       3rd Qu.: 4150               
##  Max.   :498.21     Max.   :16446       Max.   :15000               
##                                                                     
##  LP_InterestandFees LP_ServiceFees    LP_CollectionFees 
##  Min.   :   0.330   Min.   :-69.170   Min.   :-123.323  
##  1st Qu.:   1.725   1st Qu.: -6.713   1st Qu.:   0.000  
##  Median :  18.085   Median : -1.490   Median :   0.000  
##  Mean   : 195.287   Mean   : -9.947   Mean   :  -5.606  
##  3rd Qu.:  90.547   3rd Qu.: -0.880   3rd Qu.:   0.000  
##  Max.   :1445.530   Max.   : -0.330   Max.   :   0.000  
##                                                         
##  LP_GrossPrincipalLoss LP_NetPrincipalLoss LP_NonPrincipalRecoverypayments
##  Min.   :0             Min.   :0           Min.   :0                      
##  1st Qu.:0             1st Qu.:0           1st Qu.:0                      
##  Median :0             Median :0           Median :0                      
##  Mean   :0             Mean   :0           Mean   :0                      
##  3rd Qu.:0             3rd Qu.:0           3rd Qu.:0                      
##  Max.   :0             Max.   :0           Max.   :0                      
##                                                                           
##  PercentFunded   Recommendations InvestmentFromFriendsCount
##  Min.   :1.000   Min.   :0       Min.   :0                 
##  1st Qu.:1.000   1st Qu.:0       1st Qu.:0                 
##  Median :1.000   Median :0       Median :0                 
##  Mean   :1.000   Mean   :0       Mean   :0                 
##  3rd Qu.:1.000   3rd Qu.:0       3rd Qu.:0                 
##  Max.   :1.011   Max.   :0       Max.   :0                 
##                                                            
##  InvestmentFromFriendsAmount   Investors       PercentYield     
##  Min.   :0                   Min.   : 1.000   Min.   :0.000000  
##  1st Qu.:0                   1st Qu.: 3.000   1st Qu.:0.000258  
##  Median :0                   Median : 5.000   Median :0.007464  
##  Mean   :0                   Mean   : 5.045   Mean   :0.031348  
##  3rd Qu.:0                   3rd Qu.: 6.000   3rd Qu.:0.054270  
##  Max.   :0                   Max.   :14.000   Max.   :0.139049  
## 
relevant_data$LoanOriginationQuarter <- fct_explicit_na(relevant_data$LoanOriginationQuarter, "Q4 2005")
LoanStatus
plot_data <- relevant_data %>% 
  mutate(LoanStatus = ordered(LoanStatus, c("Completed","Chargedoff","Defaulted"))) %>%
  group_by(LoanOriginationQuarter, LoanStatus) %>%
  summarize(n=n()) %>%
  spread(LoanStatus,n) %>%
  replace_na(list(Completed=0, Chargedoff=0, Defaulted=0)) %>%
  summarize(percent = Completed/(Completed+Chargedoff+Defaulted))

ggplot(plot_data, aes(x = LoanOriginationQuarter, y = percent)) +
  geom_col() +
  stat_summary(fun.data = mean_cl_normal) +
  coord_flip()

PercentYield
plot_data <- relevant_data %>% 
  group_by(LoanOriginationQuarter) %>% 
  summarize(PercentYield = mean(PercentYield, na.rm = TRUE))

ggplot(plot_data, aes(x = LoanOriginationQuarter, y=PercentYield)) +
  geom_bar(stat = "identity") +
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

Ultimately, without knowing how the loans ultimately panned out, it is a bit difficult to use this data to make future predictions.

Borrower Profit

Here, I will look at BorrowerAPR (lower is better), BorrowerRate (lower is better), LoanOriginalAmount (very roughly assuming that getting larger loans is somewhat preferable, though this is likely confounded by the borrower’s needs and financial situation), MonthlyLoanPayment (lower is better), Term (roughly assuming that longer is better), and PercentFunded (more is better).

relevant_data <- data %>%
  mutate(PercentYield = ((LoanOriginalAmount-LP_NetPrincipalLoss+LP_ServiceFees+LP_CollectionFees+LP_NonPrincipalRecoverypayments+LP_InterestandFees)/LoanOriginalAmount)-1)

Scores vs. Actual Profit

Rating

plot_data <- relevant_data %>% 
  mutate(Rating = coalesce(CreditGrade, ProsperRating.alpha)) %>%
  select(Rating,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -Rating)

ggplot(plot_data, aes(x = Rating, y=value)) +
  geom_col() + 
  facet_grid(Measure ~ ., scales="free")

ProsperScore

plot_data <- relevant_data %>%
  select(ProsperScore,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -ProsperScore)

ggplot(plot_data, aes(x = factor(ProsperScore), y=value)) +
  geom_col() + 
  facet_grid(Measure ~ ., scales="free")

Estimated Profit vs. Actual Profit

EstimatedEffectiveYield

plot_data <- relevant_data %>%
  select(EstimatedEffectiveYield,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -EstimatedEffectiveYield)

ggplot(plot_data, aes(x = EstimatedEffectiveYield, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

EstimatedLoss

plot_data <- relevant_data %>%
  select(EstimatedLoss,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -EstimatedLoss)

ggplot(plot_data, aes(x = EstimatedLoss, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

EstimatedReturn

plot_data <- relevant_data %>%
  select(EstimatedReturn,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -EstimatedReturn)

ggplot(plot_data, aes(x = EstimatedReturn, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

Demographics vs. Actual Profit

Occupation

plot_data <- relevant_data %>% 
  select(Occupation,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  # mutate(Occupation = reorder(Occupation, BorrowerRate)) %>%
  gather(Measure, value, -Occupation)

ggplot(plot_data, aes(x = Occupation, y=value)) +
  geom_col() +
  facet_grid(Measure ~ ., scales="free")

There are too many occupations to make easy generalizations. Occupations would likely need to be grouped into a smaller number of categories. However, one can observe general trends - those with higher-paying occupations (or occupational prospects) seem to be more profitable customers. On the other hand, students in general are among the bank’s least profitable customers. This suggests that income, which is grouped in a more sensible manner, may be useful to look at.

IncomeRange

plot_data <- relevant_data %>%
  select(IncomeRange,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -IncomeRange)

ggplot(plot_data, aes(x = IncomeRange, y=value)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1)) + 
  facet_grid(Measure ~ ., scales="free")

Here, it can be seen that as income rises, the ProsperRating increases, and other measures of profit decrease. As we have seen, ProsperRating correlates with credit score and likelihood of not defaulting. This suggests that high-income lenders are lower-risk, but lower-income lenders, while being higher-risk, can also yield more profit.

EmploymentStatus

plot_data <- relevant_data %>%
  select(EmploymentStatus,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -EmploymentStatus)

ggplot(plot_data, aes(x = EmploymentStatus, y=value)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  coord_flip() + 
  theme(axis.text.x = element_text(angle = 90, hjust = 1)) + 
  facet_grid(Measure ~ ., scales="free")

What it looks like here is that Prosper ratings are highest for those employed, and employed full-time (it’s not clear what the difference is), lower for those who are self-employed, retired, work part-time, or ‘other,’ and much lower for those not employed. LenderYield, EstimatedEffectiveYield, and EstimatedReturn, however, are highest for those not employed, likely reflecting the higher anticipated interest charged to people in that group. Estimated Loss, correspondingly, is also highest for those not employed - there’s higher potential profit if the loans are paid back, but also significantly more risk.

EmploymentStatusDuration

plot_data <- relevant_data %>%
  select(EmploymentStatusDuration,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -EmploymentStatusDuration)

ggplot(plot_data, aes(x = EmploymentStatusDuration, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

IsBorrowerHomeowner

plot_data <- relevant_data %>%
  select(IsBorrowerHomeowner,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -IsBorrowerHomeowner)

ggplot(plot_data, aes(x = IsBorrowerHomeowner, y=value)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  facet_grid(Measure ~ ., scales="free")

CreditScoreRangeLower

plot_data <- relevant_data %>%
  select(CreditScoreRangeLower,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -CreditScoreRangeLower)

ggplot(plot_data, aes(x = CreditScoreRangeLower, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

CreditScoreRangeUpper

plot_data <- relevant_data %>%
  select(CreditScoreRangeUpper,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -CreditScoreRangeUpper)

ggplot(plot_data, aes(x = CreditScoreRangeUpper, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

FirstRecordedCreditLine

plot_data <- relevant_data %>%
  select(FirstRecordedCreditLine,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -FirstRecordedCreditLine)

ggplot(plot_data, aes(x = FirstRecordedCreditLine, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

OpenRevolvingAccounts

plot_data <- relevant_data %>%
  select(OpenRevolvingAccounts,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -OpenRevolvingAccounts)

ggplot(plot_data, aes(x = OpenRevolvingAccounts, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

InquiriesLast6Months

plot_data <- relevant_data %>%
  select(InquiriesLast6Months,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -InquiriesLast6Months)

ggplot(plot_data, aes(x = InquiriesLast6Months, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

AmountDelinquent

plot_data <- relevant_data %>%
  select(AmountDelinquent,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -AmountDelinquent)

ggplot(plot_data, aes(x = AmountDelinquent, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

DelinquenciesLast7Years

plot_data <- relevant_data %>%
  select(DelinquenciesLast7Years,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -DelinquenciesLast7Years)

ggplot(plot_data, aes(x = DelinquenciesLast7Years, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

RevolvingCreditBalance

plot_data <- relevant_data %>%
  select(RevolvingCreditBalance,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -RevolvingCreditBalance)

ggplot(plot_data, aes(x = RevolvingCreditBalance, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

BankcardUtilization

plot_data <- relevant_data %>%
  select(BankcardUtilization,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -BankcardUtilization)

ggplot(plot_data, aes(x = BankcardUtilization, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

DebtToIncomeRatio

plot_data <- relevant_data %>%
  select(DebtToIncomeRatio,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -DebtToIncomeRatio)

ggplot(plot_data, aes(x = DebtToIncomeRatio, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

IncomeVerifiable

plot_data <- relevant_data %>%
  select(IncomeVerifiable,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -IncomeVerifiable)

ggplot(plot_data, aes(x = IncomeVerifiable, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

TotalTrades

plot_data <- relevant_data %>%
  select(TotalTrades,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -TotalTrades)

ggplot(plot_data, aes(x = TotalTrades, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

TradesNeverDelinquent

plot_data <- relevant_data %>%
  select(TradesNeverDelinquent.per,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -TradesNeverDelinquent.per)

ggplot(plot_data, aes(x = TradesNeverDelinquent.per, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

Other Prosper Data vs. Actual Profit

TotalProsperLoans

plot_data <- relevant_data %>%
  select(TotalProsperLoans,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -TotalProsperLoans)

ggplot(plot_data, aes(x = factor(TotalProsperLoans), y=value)) +
  geom_boxplot() +
  stat_summary(fun.data = mean_cl_normal) + 
  facet_grid(Measure ~ ., scales="free")

OnTimeProsperPayments

plot_data <- relevant_data %>%
  select(OnTimeProsperPayments,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -OnTimeProsperPayments)

ggplot(plot_data, aes(x = OnTimeProsperPayments, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

ProsperPrincipalOutstanding

plot_data <- relevant_data %>%
  select(ProsperPrincipalOutstanding,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -ProsperPrincipalOutstanding)

ggplot(plot_data, aes(x = ProsperPrincipalOutstanding, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

Recommendations

plot_data <- relevant_data %>%
  select(Recommendations,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -Recommendations)

ggplot(plot_data, aes(x = Recommendations, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

InvestmentFromFriendsCount

plot_data <- relevant_data %>%
  select(InvestmentFromFriendsCount,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -InvestmentFromFriendsCount)

ggplot(plot_data, aes(x = InvestmentFromFriendsCount, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

InvestmentFromFriendsAmount

plot_data <- relevant_data %>%
  select(InvestmentFromFriendsAmount,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -InvestmentFromFriendsAmount)

ggplot(plot_data, aes(x = InvestmentFromFriendsAmount, y=value)) +
  stat_binhex() +
  geom_smooth(formula = y~x) + 
  facet_wrap(~Measure, scales="free")

LoanOriginationQuarter

plot_data <- relevant_data %>% 
  select(LoanOriginationQuarter,BorrowerAPR,BorrowerRate,LoanOriginalAmount,MonthlyLoanPayment,Term,PercentFunded) %>%
  gather(Measure, value, -LoanOriginationQuarter)

ggplot(plot_data, aes(x = LoanOriginationQuarter, y=value)) +
  geom_col(stat = "identity") +
  theme(axis.text.x = element_text(angle = 90, hjust = 1)) + 
  facet_grid(Measure ~ ., scales="free")

Final Plots and Summary

What is notable above is that in each graph where there is a noticeable relationship between profit predictors and profit measures, the Prosper rating is inversely correlated with the profit measures. What is also notable is that there is a consistent relationship between lender yield, and lender loss: the more the lender stands to gain, the more they stand to lose. I look at this in more detail below. What is also notable is that estimated effective yield is always a bit less than both the estimated yield, reflecting also the estimated loss.

Assuming that the various profit measures, which may reflect only profit for clients/lenders, rather than for the company itself, are in fact what we want to be looking at, it is possible to notice certain trends which may be worth looking at more closely.

Further, assuming that lenders also care about potential missed payments, particularly if this would put them in a financial bind, it is worth looking at strong demographic predictors of delinquency, which does not appear to be reflected in the Prosper rating.

Lender yield by estimated loss

Lender profit by number of open revolving accounts

Lender profit by loan origination quarter

Reflection

First, I encountered a fair bit of trouble interpreting the data without any background story. Googling around for info on Prosper loans online, I was able to get a general idea of what the company was doing, which made interpreting the data somewhat easier.

At this point, it is still difficult to say much about this data without knowing, in a lot more detail: what realities the less obvious measures reflect; the story behind the data; how certain measures are gathered and determined; and how the various measures reflect on both profit for the company, and profit for the clients. What would be needed is to take a much closer and more in-depth look at what the company does, what purpose the data serves, and how the measures were collected and what they reflect.